InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
New Rust Client Enables Building Safe, High-Performance Apps with Aerospike
Aerospike has officially released its Rust client to support high-throughput, low-latency applications interacting with its real-time NoSQL database.
-
How Cloudflare Migrated Quicksilver to Multi-Level Caching While Serving Billions of Requests
The engineering team at Cloudflare recently shared how they transitioned Quicksilver, their internal global key-value store, to a tiered caching architecture. They described their incremental journey from storing everything everywhere to adopting a distributed caching system, improving storage efficiency while preserving consistency guarantees and low-latency reads at the edge.
-
Google Launched LangExtract, a Python Library for Structured Data Extraction from Unstructured Text
Google has introduced LangExtract, an open-source Python library designed to help developers extract structured information from unstructured text using large language models such as the Gemini models.
-
OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment
OpenAI has released gpt-oss-120b and gpt-oss-20b, two open-weight language models designed for high-performance reasoning, tool use, and efficient deployment. These are the company’s first fully open-weight language models since GPT-2, and are available under the permissive Apache 2.0 license.
-
GLM-4.5 Launches with Strong Reasoning, Coding, and Agentic Capabilities
Zhipu AI has released GLM-4.5 and GLM-4.5-Air, two new AI models designed to handle reasoning, coding, and agent tasks within a single architecture. They use a dual-mode system to switch between complex problem-solving and faster responses, aiming to improve both accuracy and speed.
-
Microsoft Releases A2A .NET SDK for Building Collaborative AI Agents
Microsoft has released the A2A .NET SDK, a new developer toolkit that enables building AI agents capable of communicating and collaborating using the Agent2Agent (A2A) protocol. With support for both client and server roles, the SDK allows .NET-based agents to interact with others across ecosystems, regardless of the underlying technology.
-
OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning
OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.
-
“A Security Nightmare”: Docker Warns of Risks in MCP Toolchains
A new blog post from Docker warns that AI-powered developer tools built on the Model Context Protocol (MCP) are introducing critical security vulnerabilities — including real-world cases of credential leaks, unauthorized file access, and remote code execution.
-
Google Labs Introduces Opal, a Visual Platform for Creating AI Mini-Apps
Google Labs has introduced Opal, an experimental no-code tool that enables users to create AI-powered mini-applications through natural language descriptions and a visual workflow editor.
-
Cloudflare Chooses PostgreSQL Extension over Specialized OLAP for 100K Row/Second Analytics
In a recent article from the engineering team behind the Zero Trust product suite, Cloudflare explains why it chose TimescaleDB over ClickHouse to add analytics and reporting capabilities to its internal platform. The author highlights the “phenomenal balance” between the simplicity of storing analytical data alongside configuration data and the performance of a specialized OLAP system.
-
Google Releases Major Firebase Studio Updates for Agentic AI Development
At Google Cloud Summit London in early July, Google revealed new capabilities in Firebase Studio that promise to enhance agentic cloud-based development: an autonomous Agent mode, native support for Model Context Protocol (MCP), and Gemini CLI integration. These updates aim to streamline agentic AI development by making AI agents more independent and seamlessly embedded in developer workflows.
-
Apple Open Sources Diffusion-Based Coding Model DiffuCoder
Apple open sourced DiffuCoder, a diffusion large language model (dLLM) fine-tuned for coding tasks. DiffuCoder is based on Qwen-2.5-Coder and outperforms other code-specific LLMs on several coding benchmarks.
-
Anthropic Proposes Transparency Framework to Safeguard Frontier AI Development
Anthropic has proposed a new transparency framework designed to address the growing need for accountability in the development of frontier AI models. This proposal focuses on the largest AI companies that are developing powerful AI models, distinguished by factors such as computing power, cost, evaluation performance, and annual R&D expenditures.
-
Apple Shares Details on Upcoming AI Foundation Models for iOS 26
In a recent tech report, Apple has provided more details on the performance and characteristics of the new Apple Intelligence Foundation Models that will be part of iOS 26, as announced at the latest WWDC 2025.
-
The White House Releases National AI Strategy Focused on Innovation, Infrastructure, and Global Lead
The White House has published America’s AI Action Plan, outlining a national strategy to enhance U.S. leadership in artificial intelligence. The plan follows President Trump’s January Executive Order 14179, which directed federal agencies to accelerate AI development and remove regulatory barriers to innovation.