InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

From Hadoop to Kubernetes: Pinterest’s Scalable Spark Architecture on AWS EKS

Pinterest revamped its data infrastructure by transitioning from a legacy Hadoop system to the Moka platform, leveraging Kubernetes and Spark on AWS EKS. This strategic shift enhances job isolation, simplifies deployment, and optimizes resource management, leading to reduced costs and improved efficiency.

Eran Stiller
on Jul 28, 2025
AI, ML & Data Engineering

Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods

Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.

Vinod Goje
on Jul 28, 2025
AI, ML & Data Engineering

Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling

Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.

Robert Krzaczyński
on Jul 26, 2025
Cloud

AWS Introduces Vector Capabilities on Amazon S3

At the recent AWS Summit in New York City, AWS announced the preview of Amazon S3 Vectors, claiming to be the first cloud object store with native support for storing large vector datasets. The new option offers subsecond query performance, reducing the cost of storing AI-ready data compared to traditional vector databases.

Renato Losio
on Jul 26, 2025
Architecture & Design

State Space Models Can Enable AI in Low-Power Edge Computing

At the the 2025 Embedded Vision Summit, Tony Lewis, chief technology officer at BrainChip, presented research done by his company into state space models (SSMs) and how they can provide LLM capabilities with very low power consumption in limited computing environments, such as those found on dashcams, medical devices, security cameras, and even toys.

Patrick Farry
on Jul 24, 2025
AI, ML & Data Engineering

Perplexity Launches Comet: a Browser Designed around AI-Assisted Interaction

Perplexity has introduced Comet, a new web browser designed to integrate natural language interaction directly into the browsing experience. Unlike conventional browsers built around navigation and search, Comet aims to support users in research, comparison, and task execution by combining browsing with persistent context and AI assistance.

Robert Krzaczyński
on Jul 23, 2025
AI, ML & Data Engineering

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

Mistral has released Voxtral, a large language model aimed at speech recognition (ASR) applications that seek to integrate more advanced LLM-based capabilities and go beyond simple transcription. For two variants of the model, Voxtral Mini (3B) and Voxtral Small (24B), Mistral has released the weights under the Apache 2.0 license.

Sergio De Simone
on Jul 23, 2025
AI, ML & Data Engineering

OpenAI Announces Generalist ChatGPT Agent to Take on Excel, PowerPoint, and Chrome

OpenAI's ChatGPT Agent merges advanced browsing and summarization for seamless data handling. Developers can now generate editable spreadsheets and presentations with simple prompts, integrating outputs directly into productivity tools. With impressive accuracy and connectivity, it enhances workflow efficiency while automating complex tasks, heralding a new era in AI-driven productivity.

Andrew Hoblitzell
on Jul 22, 2025
AI, ML & Data Engineering

Amazon Launches Bedrock AgentCore for Enterprise AI Agent Infrastructure

Amazon announced the preview of Amazon Bedrock AgentCore, a collection of enterprise-grade services that help developers deploy and operate AI agents at scale across frameworks and foundation models. The platform addresses infrastructure challenges developers face when building production AI agents.

Vinod Goje
on Jul 22, 2025
AI, ML & Data Engineering

Inaugural MCP Dev Summit Charts AI Integration's Future

Developers and contributors of the Model Context Protocol (MCP) converged in San Francisco in May 2025 for their first developer summit, charting the future of this rapidly adopted open standard to enable seamless integration between LLM applications and external data sources and tools. Discussions focused on a roadmap for MCP, including critical enterprise features.

Hien Luu
on Jul 17, 2025
Cloud

Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance

AWS has recently announced that Amazon S3 now supports sort and z-order compaction for Apache Iceberg tables. The new features reduce scan times and engine costs, and are available for both S3 Tables and traditional S3 buckets using AWS Glue Data Catalog optimization.

Renato Losio
on Jul 16, 2025
AI, ML & Data Engineering

Google DeepMind Announces Robotics Foundation Model Gemini Robotics On-Device

Google DeepMind introduced Gemini Robotics On-Device, a vision-language-action (VLA) foundation model designed to run locally on robot hardware. The model features low-latency inference and can be fine-tuned for specific tasks with as few as 50 demonstrations.

Anthony Alford
on Jul 15, 2025
AI, ML & Data Engineering

Hugging Face Launches Reachy Mini Robots for Human-Robot Interaction

Hugging Face has launched its Reachy Mini robots, now available for order. Designed for AI developers, researchers, and enthusiasts, the robots offer an exciting opportunity to experiment with human-robot interaction and AI applications.

Daniel Dominguez
on Jul 15, 2025
Cloud

Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service

Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!

Steef-Jan Wiggers
on Jul 14, 2025
Mobile

Arm Scalable Matrix Extension 2 Coming to Android to Accelerate On-Device AI

Available in the Armv9-A architecture, Arm Scalable Matrix Extension 2 (SME2) is a set of advanced CPU instructions designed to accelerate matrix heavy computation. The new Arm technology aims to help mobile developers to run advanced AI models directly on CPU with improved performance and efficiency, without requiring any changes to their apps.

Sergio De Simone
on Jul 13, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News