InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Amazon Aurora DSQL Goes GA: Distributed, PostgreSQL-Compatible Serverless Database
Amazon has recently announced the general availability of Amazon Aurora DSQL, a PostgreSQL-compatible, serverless, and distributed database. The new managed service is designed to support active-active high availability and multi-region strong consistency.
-
AWS Introduces Open Source Model Context Protocol Servers for ECS, EKS, and Serverless
AWS has launched open-source Model Context Protocol (MCP) servers on GitHub to supercharge AI development within Amazon ECS, EKS, and Serverless environments. These specialized tools equip developers with real-time, context-specific insights, enhancing application deployment, troubleshooting, and operational efficiency. Empower your cloud experience today!
-
Perplexity Introduces Labs for Project-Based AI Workflows
Perplexity has released Labs, a new feature for Pro subscribers designed to support more complex tasks beyond question answering. The update marks a shift from search-based interactions toward structured, multi-step workflows powered by generative AI.
-
Another Rust Rewrite: OpenAI’s Codex CLI Goes Native, Drops Node and TypeScript for Rust
OpenAI recently announced rewriting its Codex CLI in Rust. Codex CLI stack originally features React, TypeScript and Node. The rewrite seeks security and performance gains on top of improved developer experience.
-
Google Brings Gemini Nano to ML Kit with New On-Device GenAI APIs
The new GenAI APIs recently added to ML Kit enable developers to use Gemini Nano for on-device inference in Android apps, supporting features like summarization, proofreading, rewriting, and image description.
-
Anthropic Introduces Claude 4 Family and Claude Code
Anthropic released Claude Opus 4 and Sonnet 4, the newest versions of their Claude series of LLMs. Both models support extended thinking, tool use, and memory improvements, and Claude 4 Opus outperforms other LLMs on coding benchmarks.
-
Amazon Open Sources Strands Agents SDK for Building AI Agents
Amazon has released Strands Agents, an open source SDK that simplifies AI agent development through a model-driven approach. The framework enables developers to build agents by defining prompts and tool lists with minimal code.
-
Google Releases LMEval, an Open-Source Cross-Provider LLM Evaluation Tool
LMEval aims to help AI researchers and developers compare the performance of different large language models. Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.
-
Azure AI Search Unveils Agentic Retrieval for Smarter Conversational AI
Microsoft’s Azure AI Search unveils agentic retrieval, a cutting-edge query engine that enhances conversational AI answer relevance by up to 40%. This dynamic system leverages conversation history and parallel subquery execution, paving the way for sophisticated knowledge retrieval. Currently in public preview, it offers adaptive search strategies tailored for evolving enterprise needs.
-
OpenSearch 3.0 Now Generally Available, with a Focus on Vector Database Performance and Scalability
The OpenSearch Software Foundation has announced the general availability of OpenSearch 3.0, the first major release in three years and the first since the project joined the Linux Foundation. This version introduces native support for the Model Context Protocol (MCP), along with pull-based data ingestion and gRPC support, aimed at improving scalability and integration.
-
Google Releases MedGemma: Open AI Models for Medical Text and Image Analysis
Google has released MedGemma, a pair of open-source generative AI models designed to support medical text and image understanding in healthcare applications. Based on the Gemma 3 architecture, the models are available in two configurations: MedGemma 4B, a multimodal model capable of processing both images and text, and MedGemma 27B, a larger model focused solely on medical text.
-
Microsoft Announces AI Agent and Platform Updates at Build 2025
At its annual developer conference, Build 2025, Microsoft introduced a set of updates focused on expanding the role of AI agents across Windows, GitHub, Azure, and Microsoft 365.
-
Gemma 3n Available for On-Device Inference Alongside RAG and Function Calling Libraries
Google has announced that Gemma 3n is now available in preview on the new LiteRT Hugging Face community, alongside many previously released models. Gemma 3n is a multimodal small language model that supports text, image, video, and audio inputs. It also supports finetuning, customization through retrieval-augmented generation (RAG), and function calling using new AI Edge SDKs.
-
Google DeepMind Unveils AI Coding Agent AlphaEvolve
Google DeepMind published a paper describing their AlphaEvolve coding agent. AlphaEvolve uses LLMs to discover and optimize algorithms across a range of domains, including hardware design, data center operations, and AI training.
-
Apollo GraphQL Launches MCP Server: a New Gateway Between AI Agents and Enterprise APIs
Apollo GraphQL recently launched its MCP Server, enabling businesses to securely and efficiently integrate AI agents with existing APIs using GraphQL. The platform empowers teams to scale innovation and drive faster time-to-value from AI investments by reducing development overhead, improving governance, and accelerating AI feature delivery.