InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Google DeepMind Introduces QuestBench to Evaluate LLMs in Solving Logic and Math Problems
Google DeepMind’s QuestBench benchmark helps in evaluating if LLMs can pinpoint the single, crucial question needed to solve logic, planning, or math problems. DeepMind team recently published an article on QuestBench which is a set of underspecified reasoning tasks solvable by asking at most one question.
-
Docker Model Runner Aims to Make it Easier to Run LLM Models Locally
Currently in preview with Docker Desktop 4.40 for macOS on Apple Silicon, Docker Model Runner allows developers to run models locally and iterate on application code using the local models- without disrupting their container-based workflows.
-
QCon London: AI Agents Can Work Together to Make Humans Better
In a well-received closing keynote at QCon London 2025, independent AI consultant Hannah Foxwell challenged the common narrative about AI making us more productive and helping us to do more, instead arguing that AI agents should be designed to eliminate mundane work for us rather than replace human jobs.
-
.NET AI Chat Web App Template Preview 2 Released with Qdrant and .NET Aspire Support
The second preview of the .NET AI Chat Web App template has been released, introducing several enhancements to streamline the development of AI-powered chat applications. This update includes support for .NET Aspire and integration with the Qdrant vector database, facilitating the creation of cloud-native applications.
-
Redis 8 Targets AI Applications with New Data Type for Vector Similarity
Redis has recently announced the addition of Vector Set, a data type designed for vector similarity and a new option for AI applications. This new data type marks the first major contribution from Salvatore Sanfilippo (aka ‘antirez’), the creator of Redis, since rejoining the company.
-
AI Continent: European Commission Outlines Strategy for Scaling AI Development
The European Commission has presented the AI Continent Action Plan, a new strategy designed to strengthen the European Union’s capacity for AI development and deployment. The plan outlines coordinated investment in infrastructure, access to high-quality data, AI adoption in strategic sectors, and support for regulatory implementation.
-
The Open-Source Version of InfluxDB 3 Reaches GA
Two years after releasing the GA version of InfluxData’s enterprise edition, their open-source version also reached that level of maturity. Conceptualised for real-time workloads and ease of running, the core version leaves aside features like long-term storage optimisations, compaction or high availability (HA), read replicas, or fine-grained access controls.
-
FastAPI-MCP: Simplifying the Integration of FastAPI with AI Agents
A new open-source library, FastAPI-MCP, is making it easier for developers to connect traditional FastAPI applications with modern AI agents through the Model Context Protocol (MCP). Designed for zero-configuration setup, FastAPI-MCP allows developers to automatically expose their API endpoints as MCP-compatible tools.
-
Google’s Cybersecurity Model Sec-Gemini Enables SecOps Workflows for Root Cause and Threat Analysis
Google’s new cybersecurity model Sec-Gemini focuses on cybersecurity AI to enable SecOps workflows for root cause analysis (RCA) and threat analysis, and vulnerability impact understanding.
-
Google Open-Sources Agent2Agent Protocol for Agentic Collaboration
Google released the Agent2Agent (A2A) Protocol, an open-source specification for building AI agents that can connect with other agents that support the protocol. Google has enlisted over 50 technology partners to contribute to A2A's development.
-
QCon London: How to Design GenAI Interaction from the Company That Designed Apple’s First Mouse
During her QCon London keynote, Savannah Kunovsky, managing director of emerging technologies at IDEO, talked about how design thinking can ensure that the products we build are not only technically impactful but genuinely impactful. She also discussed how we can use Gen AI to assist people in being more connected and adapting their interactions to their users' needs.
-
Google Releases Open-Source Agent Development Kit for Multi-Agent AI Applications
At Google Cloud Next 2025, Google announced the Agent Development Kit (ADK), an open-source framework aimed at simplifying the development of intelligent, multi-agent applications. The toolkit is designed to support developers across the entire lifecycle of agentic systems — from logic design and orchestration to debugging, evaluation, and deployment.
-
Datadog Employs LLMs for Assisting with Writing Accident Postmortems
Datadog combined structured metadata from its incident management app with Slack messages to create an LLM-driven functionality assisting engineers in composing incident postmortems. While working on this solution, the company dealt with the challenges of using LLMs outside of the interactive dialog systems and ensuring that high-quality content was produced.
-
Anthropic's "AI Microscope" Explores the Inner Workings of Large Language Models
Two recent papers from Anthropic attempt to shed light on the processes that take place within a large language model, exploring how to locate interpretable concepts and link them to the computational "circuits" that translate them into language, and how to characterize crucial behaviors of Claude Haiku 3.5, including hallucinations, planning, and other key traits.
-
Claude for Education: Anthropic’s AI Assistant Goes to University
Anthropic has announced the launch of Claude for Education, a specialized version of its AI assistant, Claude, developed specifically for colleges and universities. The initiative aims to support students, faculty, and administrators with secure and responsible AI integration across academics and campus operations.