InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Kaggle Introduces Game Arena to Benchmark AI Models in Strategic Games
Kaggle, in collaboration with Google DeepMind, has introduced Kaggle Game Arena, a platform designed to evaluate artificial intelligence models by testing their performance in strategy-based games.
-
Introducing the MCP Registry
The Model Context Protocol (MCP) ecosystem is enhancing AI development with a public registry for server discovery and a secure gateway for agent interactions. This initiative, featuring the recently launched MCP Registry and the Linux Foundation's Agentgateway project, streamlines the management of AI tools, fostering collaboration and security for engineering teams.
-
How LinkedIn Built Enterprise Multi-Agent AI on Existing Messaging Infrastructure
LinkedIn extended its generative AI application platform to support multi-agent systems by repurposing its existing messaging infrastructure as an orchestration layer. This allowed the company to scale AI agents without building new coordination technology from scratch and achieve global availability while supporting complex multi-step workflows through agent coordination.
-
Hugging Face Releases FinePDFs: a 3-Trillion-Token Dataset Built from PDFs
Hugging Face has unveiled FinePDFs, the largest publicly available corpus built entirely from PDFs. The dataset spans 475 million documents in 1,733 languages, totaling roughly 3 trillion tokens. At 3.65 terabytes in size, FinePDFs introduces a new dimension to open training datasets by tapping into a resource long considered too complex and expensive to process.
-
Cloudflare Introduces Automated Scoring for Shadow AI Risk Assessment
During AI Week 2025, Cloudflare announced Application Confidence Scores, an automated assessment system that is designed to help organizations evaluate the safety and security of third-party AI applications at scale.
-
Vercel Introduces AI Gateway for Multi-Model Integration
Vercel has rolled out the AI Gateway for production workloads. The service provides a single API endpoint for accessing a wide range of large language and generative models, aiming to simplify integration and management for developers.
-
Google DeepMind Launches EmbeddingGemma, an Open Model for On-Device Embeddings
Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection.
-
OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing
OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.
-
Hugging Face Introduces AI Sheets, a No-Code Tool for Dataset Transformation
Hugging Face has released AI Sheets, an open-source application designed to let users build, transform, and enrich datasets using AI models through a spreadsheet-like interface. The tool, available both on the Hub and for local deployment, allows users to experiment with thousands of open models, including OpenAI’s gpt-oss, without requiring code.
-
FerretDB Cloud: Open Source Alternative to MongoDB Atlas?
FerretDB has recently announced the availability of FerretDB Cloud, a managed MongoDB-compatible database service built on open source DocumentDB. Targeting developers seeking the first cross-cloud DocumentDB-based solution and an alternative to MongoDB Atlas, FerretDB Cloud is currently available on AWS only.
-
xAI Releases Grok Code Fast 1, a New Model for Agentic Coding
xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows.
-
Android Studio Narwhal Extends Gemini AI Capabilities
The latest Android Studio Narwhal 3 Feature Drop introduces enhancements aimed at boosting developer productivity, including support for resizable Compose previews, new app Backup & Restore tools, and expanded Gemini capabilities such as automatic code generation from UI screenshots.
-
InfoQ Dev Summit Munich 2025: Master the 'How' with Deep-Dive, Practitioner-Led Guidance
At InfoQ Dev Summit Munich (Oct 15-16), learn directly from the senior engineers building complex systems. This practitioner-led conference offers deep dives on real-world implementation patterns from software leaders at Allianz, Skyscanner, Zalando, and Delivery Hero.
-
Google Launches Gemini 2.5 Flash Image with Advanced Editing and Consistency Features
Google released Gemini 2.5 Flash Image (nicknamed nano-banana), its newest image generation and editing model. The system introduces several upgrades over earlier Flash models, including character consistency across prompts, multi-image fusion, precise prompt-based editing, and integration of world knowledge for semantic understanding.
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.