InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools
Mistral has released Voxtral, a large language model aimed at speech recognition (ASR) applications that seek to integrate more advanced LLM-based capabilities and go beyond simple transcription. For two variants of the model, Voxtral Mini (3B) and Voxtral Small (24B), Mistral has released the weights under the Apache 2.0 license.
-
OpenAI Announces Generalist ChatGPT Agent to Take on Excel, PowerPoint, and Chrome
OpenAI's ChatGPT Agent merges advanced browsing and summarization for seamless data handling. Developers can now generate editable spreadsheets and presentations with simple prompts, integrating outputs directly into productivity tools. With impressive accuracy and connectivity, it enhances workflow efficiency while automating complex tasks, heralding a new era in AI-driven productivity.
-
Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure
Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.
-
Inaugural MCP Dev Summit Charts AI Integration's Future
Developers and contributors of the Model Context Protocol (MCP) converged in San Francisco in May 2025 for their first developer summit, charting the future of this rapidly adopted open standard to enable seamless integration between LLM applications and external data sources and tools. Discussions focused on a roadmap for MCP, including critical enterprise features.
-
Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance
AWS has recently announced that Amazon S3 now supports sort and z-order compaction for Apache Iceberg tables. The new features reduce scan times and engine costs, and are available for both S3 Tables and traditional S3 buckets using AWS Glue Data Catalog optimization.
-
Google DeepMind Announces Robotics Foundation Model Gemini Robotics On-Device
Google DeepMind introduced Gemini Robotics On-Device, a vision-language-action (VLA) foundation model designed to run locally on robot hardware. The model features low-latency inference and can be fine-tuned for specific tasks with as few as 50 demonstrations.
-
Hugging Face Launches Reachy Mini Robots for Human-Robot Interaction
Hugging Face has launched its Reachy Mini robots, now available for order. Designed for AI developers, researchers, and enthusiasts, the robots offer an exciting opportunity to experiment with human-robot interaction and AI applications.
-
Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service
Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!
-
Arm Scalable Matrix Extension 2 Coming to Android to Accelerate On-Device AI
Available in the Armv9-A architecture, Arm Scalable Matrix Extension 2 (SME2) is a set of advanced CPU instructions designed to accelerate matrix heavy computation. The new Arm technology aims to help mobile developers to run advanced AI models directly on CPU with improved performance and efficiency, without requiring any changes to their apps.
-
Microsoft Launches Azure DevOps MCP Server in Public Preview
Microsoft has unveiled the Azure DevOps Model Context Provider (MCP) Server in public preview, enabling seamless interaction between GitHub Copilot and Azure DevOps. This innovative tool allows developers to query and manage project data using natural language directly within VS Code, streamlining workflows and enhancing productivity while ensuring project data remains secure and local.
-
Anthropic Introduces Economic Futures Program to Address the Economic Impact of AI
Anthropic has announced the launch of its Economic Futures Program, an initiative designed to address the economic impact of AI.
-
Researchers Attempt to Uncover the Origins of Creativity in Diffusion Models
In a recent paper, Stanford researchers Mason Kamb and Surya Ganguli proposed a mechanism that could underlie the creativity of diffusion models. The mathematical model they developed suggests that this creativity is a deterministic consequence of how those models use the denoising process to generate images.
-
Atlassian's 4 Million PostgreSQL Database Migration: When Standard Cloud Strategies Fail
Atlassian recently migrated 4 million Jira databases to Amazon Aurora, intending to reduce costs and improve the reliability of its Jira Cloud platform. Due to the large number of files involved and the constraints of managed services, the team developed a custom tool to orchestrate the process, as traditional cloud migration strategies were not viable.
-
LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs
LM Studio has released version 0.3.17, introducing support for the Model Context Protocol (MCP) — a step forward in enabling language models to access external tools and data sources. Originally developed by Anthropic, MCP defines a standardized interface for connecting LLMs to services such as GitHub, Notion, or Stripe, enabling more powerful, contextual reasoning.
-
Gemma 3n Introduces Novel Techniques for Enhanced Mobile AI Inference
Launched in early preview last May, Gemma 3n is now officially available. It targets mobile-first, on-device AI applications, using new techniques designed to increase efficiency and improve performance, such as per-layer embeddings and transformer nesting.