InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

Mistral has released Voxtral, a large language model aimed at speech recognition (ASR) applications that seek to integrate more advanced LLM-based capabilities and go beyond simple transcription. For two variants of the model, Voxtral Mini (3B) and Voxtral Small (24B), Mistral has released the weights under the Apache 2.0 license.

Sergio De Simone
on Jul 23, 2025
AI, ML & Data Engineering

OpenAI Announces Generalist ChatGPT Agent to Take on Excel, PowerPoint, and Chrome

OpenAI's ChatGPT Agent merges advanced browsing and summarization for seamless data handling. Developers can now generate editable spreadsheets and presentations with simple prompts, integrating outputs directly into productivity tools. With impressive accuracy and connectivity, it enhances workflow efficiency while automating complex tasks, heralding a new era in AI-driven productivity.

Andrew Hoblitzell
on Jul 22, 2025
AI, ML & Data Engineering

Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure

Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.

Vinod Goje
on Jul 22, 2025
AI, ML & Data Engineering

Inaugural MCP Dev Summit Charts AI Integration's Future

Developers and contributors of the Model Context Protocol (MCP) converged in San Francisco in May 2025 for their first developer summit, charting the future of this rapidly adopted open standard to enable seamless integration between LLM applications and external data sources and tools. Discussions focused on a roadmap for MCP, including critical enterprise features.

Hien Luu
on Jul 17, 2025
Cloud

Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance

AWS has recently announced that Amazon S3 now supports sort and z-order compaction for Apache Iceberg tables. The new features reduce scan times and engine costs, and are available for both S3 Tables and traditional S3 buckets using AWS Glue Data Catalog optimization.

Renato Losio
on Jul 16, 2025
AI, ML & Data Engineering

Google DeepMind Announces Robotics Foundation Model Gemini Robotics On-Device

Google DeepMind introduced Gemini Robotics On-Device, a vision-language-action (VLA) foundation model designed to run locally on robot hardware. The model features low-latency inference and can be fine-tuned for specific tasks with as few as 50 demonstrations.

Anthony Alford
on Jul 15, 2025
AI, ML & Data Engineering

Hugging Face Launches Reachy Mini Robots for Human-Robot Interaction

Hugging Face has launched its Reachy Mini robots, now available for order. Designed for AI developers, researchers, and enthusiasts, the robots offer an exciting opportunity to experiment with human-robot interaction and AI applications.

Daniel Dominguez
on Jul 15, 2025
Cloud

Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service

Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!

Steef-Jan Wiggers
on Jul 14, 2025
Mobile

Arm Scalable Matrix Extension 2 Coming to Android to Accelerate On-Device AI

Available in the Armv9-A architecture, Arm Scalable Matrix Extension 2 (SME2) is a set of advanced CPU instructions designed to accelerate matrix heavy computation. The new Arm technology aims to help mobile developers to run advanced AI models directly on CPU with improved performance and efficiency, without requiring any changes to their apps.

Sergio De Simone
on Jul 13, 2025
DevOps

Microsoft Launches Azure DevOps MCP Server in Public Preview

Microsoft has unveiled the Azure DevOps Model Context Provider (MCP) Server in public preview, enabling seamless interaction between GitHub Copilot and Azure DevOps. This innovative tool allows developers to query and manage project data using natural language directly within VS Code, streamlining workflows and enhancing productivity while ensuring project data remains secure and local.

Mark Silvester
on Jul 12, 2025
AI, ML & Data Engineering

Anthropic Introduces Economic Futures Program to Address the Economic Impact of AI

Anthropic has announced the launch of its Economic Futures Program, an initiative designed to address the economic impact of AI.

Daniel Dominguez
on Jul 11, 2025
AI, ML & Data Engineering

Researchers Attempt to Uncover the Origins of Creativity in Diffusion Models

In a recent paper, Stanford researchers Mason Kamb and Surya Ganguli proposed a mechanism that could underlie the creativity of diffusion models. The mathematical model they developed suggests that this creativity is a deterministic consequence of how those models use the denoising process to generate images.

Sergio De Simone
on Jul 06, 2025
Cloud

Atlassian's 4 Million PostgreSQL Database Migration: When Standard Cloud Strategies Fail

Atlassian recently migrated 4 million Jira databases to Amazon Aurora, intending to reduce costs and improve the reliability of its Jira Cloud platform. Due to the large number of files involved and the constraints of managed services, the team developed a custom tool to orchestrate the process, as traditional cloud migration strategies were not viable.

Renato Losio
on Jul 05, 2025
AI, ML & Data Engineering

LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs

LM Studio has released version 0.3.17, introducing support for the Model Context Protocol (MCP) — a step forward in enabling language models to access external tools and data sources. Originally developed by Anthropic, MCP defines a standardized interface for connecting LLMs to services such as GitHub, Notion, or Stripe, enabling more powerful, contextual reasoning.

Robert Krzaczyński
on Jul 05, 2025
Mobile

Gemma 3n Introduces Novel Techniques for Enhanced Mobile AI Inference

Launched in early preview last May, Gemma 3n is now officially available. It targets mobile-first, on-device AI applications, using new techniques designed to increase efficiency and improve performance, such as per-layer embeddings and transformer nesting.

Sergio De Simone
on Jul 04, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News