InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service
Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!
-
Arm Scalable Matrix Extension 2 Coming to Android to Accelerate On-Device AI
Available in the Armv9-A architecture, Arm Scalable Matrix Extension 2 (SME2) is a set of advanced CPU instructions designed to accelerate matrix heavy computation. The new Arm technology aims to help mobile developers to run advanced AI models directly on CPU with improved performance and efficiency, without requiring any changes to their apps.
-
Microsoft Launches Azure DevOps MCP Server in Public Preview
Microsoft has unveiled the Azure DevOps Model Context Provider (MCP) Server in public preview, enabling seamless interaction between GitHub Copilot and Azure DevOps. This innovative tool allows developers to query and manage project data using natural language directly within VS Code, streamlining workflows and enhancing productivity while ensuring project data remains secure and local.
-
Anthropic Introduces Economic Futures Program to Address the Economic Impact of AI
Anthropic has announced the launch of its Economic Futures Program, an initiative designed to address the economic impact of AI.
-
Researchers Attempt to Uncover the Origins of Creativity in Diffusion Models
In a recent paper, Stanford researchers Mason Kamb and Surya Ganguli proposed a mechanism that could underlie the creativity of diffusion models. The mathematical model they developed suggests that this creativity is a deterministic consequence of how those models use the denoising process to generate images.
-
Atlassian's 4 Million PostgreSQL Database Migration: When Standard Cloud Strategies Fail
Atlassian recently migrated 4 million Jira databases to Amazon Aurora, intending to reduce costs and improve the reliability of its Jira Cloud platform. Due to the large number of files involved and the constraints of managed services, the team developed a custom tool to orchestrate the process, as traditional cloud migration strategies were not viable.
-
LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs
LM Studio has released version 0.3.17, introducing support for the Model Context Protocol (MCP) — a step forward in enabling language models to access external tools and data sources. Originally developed by Anthropic, MCP defines a standardized interface for connecting LLMs to services such as GitHub, Notion, or Stripe, enabling more powerful, contextual reasoning.
-
Gemma 3n Introduces Novel Techniques for Enhanced Mobile AI Inference
Launched in early preview last May, Gemma 3n is now officially available. It targets mobile-first, on-device AI applications, using new techniques designed to increase efficiency and improve performance, such as per-layer embeddings and transformer nesting.
-
Navigating Complexity, from AI Strategy to Resilient Architecture: InfoQ Dev Summit Munich 2025
Tired of conferences that don't address your real challenges? The InfoQ Dev Summit Munich 2025 schedule is different. It's packed with sessions on the topics that keep us up at night: responsible AI adoption, leadership friction, and EU data sovereignty
-
Google Launches Gemini CLI: Open-Source Terminal AI Agent for Developers
Google has released Gemini CLI, a new open-source AI command-line interface that brings the full capabilities of its Gemini 2.5 Pro model directly into developers’ terminals. Designed for flexibility, transparency, and developer-first workflows, Gemini CLI provides high-performance, natural language AI assistance through a lightweight, locally accessible interface.
-
DevSummit Boston: Key Lessons from Shipping AI Products beyond the Hype
Phil Calçado, CEO of Outropy, shared key insights at the InfoQ Dev Summit on scaling generative AI products. He highlighted the need for effective workflows and agents in AI development, advocating for iterative approaches that leverage proven software engineering principles. His insights promise to guide teams in building resilient AI systems without reinventing the wheel.
-
Google's Agent2Agent Protocol Enters the Linux Foundation
Recently open-sourced by Google, the Agent2Agent protocol is now part of the Linux Foundation, along with its accompanying SDKs and developer tools.
-
Apple's Illusion of Thinking Paper Explores Limits of Large Reasoning Models
Apple Machine Learning Research published a paper titled "The Illusion of Thinking," which investigates the abilities of Large Reasoning Models (LRMs) on a set of puzzles. As the complexity of the puzzles increases, the researchers found that LRMs encounter a "collapse" threshold where the models reduce their reasoning effort, indicating a limit to the models' scalability.
-
Google DeepMind Unveils AlphaGenome: a Unified AI Model for High-Resolution Genome Interpretation
Google DeepMind has announced the release of AlphaGenome, a new AI model designed to predict how genetic variants affect gene regulation across the entire genome. It represents a significant advancement in computational genomics by integrating long-range sequence context with base-pair resolution in a single, general-purpose architecture.
-
Anthropic Upgrades App-Building Capabilities to Claude Artifacts
Anthropic has upgraded Claude with new app-building capabilities, allowing users to create, host, and share AI applications directly from text prompts. This functionality, known as Artifacts, enables users to build functional tools like data analyzers, flashcard generators, or study aids by simply describing their ideas.