InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Cloudflare Introduces Aggregations in R2 SQL for Data Analytics
Cloudflare recently announced support for aggregations in R2 SQL, a new feature that lets developers run SQL queries on data stored in R2. This enhancement expands R2 SQL beyond basic filtering and makes it more useful for analytical workloads without requiring separate data warehouse tools.
-
AWS Hikes EC2 Capacity Block Rates by 15% in Uniform ML Pricing Adjustment
AWS has raised EC2 Capacity Block prices for ML by 15% across all regions, impacting GPU-based workloads. The uniform price hikes affect top-tier instances powered by NVIDIA GPUs, underscoring supply chain pressures and inflation. With limited alternatives, organizations face higher costs, emphasizing the need for effective workload optimization and cost management strategies.
-
Mistral Releases OCR 3 with Improved Accuracy on Handwritten and Structured Documents
Mistral has released Mistral OCR 3, the latest version of its optical character recognition model, focused on higher accuracy across a wide range of document types, including handwritten notes, forms, low-quality scans, and complex tables.
-
How Agoda Unified Multiple Data Pipelines into a Single Source of Truth
Agoda recently described how it consolidated multiple independent data pipelines into a centralized Apache Spark-based platform to eliminate inconsistencies in financial data. The company implemented a multi-layered quality framework that combines automated validations, machine-learning-based anomaly detection, and data contracts, while processing millions of daily booking transactions.
-
AI-Powered Code Editor Cursor Introduces Dynamic Context Discovery to Improve Token-Efficiency
Cursor introduced a new approach to minimize the context size of requests sent to large language models. Called dynamic context discovery, this method moves away from including large amounts of static context upfront, allowing the agent to dynamically retrieve only the information it needs. This reduces token usage and limits the inclusion of potentially confusing or irrelevant details.
-
Vercel Open-Sources Bash Tool for Context Retrieval Using Local Filesystems
Vercel has open-sourced bash-tool that provides a Bash execution engine for AI agents, enabling them to run filesystem-based commands to retrieve context for model prompts.
-
QCon London 2026: Practitioner-Led Tracks on Connectivity & Production AI Engineering
QCon London 2026 returns March 16–19 with 15 tracks for senior leads. Key sessions cover system integration via MCP, AI engineering, and debugging distributed systems. Explore modern security, Staff+ insights, and performance optimization with peer-led and practical discussions.
-
Solving Fragmented Mobile Analytics: Uber’s Platform-Led Approach
Uber Engineering outlines its platform-led mobile analytics redesign, standardizing event instrumentation across iOS and Android to improve cross-platform consistency, reduce engineering effort, and provide reliable insights for product and data teams.
-
Google Introduces Conductor, a Context-Driven Development Extension for Gemini CLI
Google has released Conductor, a new preview extension for Gemini CLI that introduces a structured, context-driven approach to AI-assisted software development. The extension is designed to address a common limitation of chat-based coding tools: the loss of project context across sessions.
-
Google Releases Gemma Scope 2 to Deepen Understanding of LLM Behavior
Gemma Scope 2 is a suite of tools designed to interpret the behavior of Gemini 3 models, enabling researchers to analyze emergent model behaviors, audit and debug AI agents, and devise mitigation strategies against security issues like jailbreaks, hallucinations and sycophancy.
-
FACTS Benchmark Suite Introduced to Evaluate Factual Accuracy of Large Language Models
A new industry benchmark aimed at systematically evaluating the factual accuracy of LLMs has been released with the launch of the FACTS Benchmark Suite. Developed by the FACTS team in collaboration with Kaggle, the suite expands earlier work on factual grounding and introduces a broader, multi-dimensional framework for measuring how reliably language models produce factually correct responses.
-
Inside the Development Workflow of Claude Code's Creator
Claude Code's creator Boris Cherny described how he uses it at Anthropic, highlighting practices such as running parallel instances, sharing learnings, automating prompting, and rigorously verifying results to compound productivity over time.
-
NVIDIA Releases Open Models, Datasets, and Tools across AI, Robotics, and Autonomous Driving
NVIDIA has released a set of open models, datasets, and development tools covering language, agentic systems, robotics, autonomous driving, and biomedical research. The update expands several existing NVIDIA model families and makes accompanying training data and reference implementations available through GitHub, Hugging Face, and NVIDIA’s developer platforms.
-
MongoBleed Vulnerability Allows Attackers to Read Data from MongoDB's Heap Memory
MongoDB recently patched CVE-2025-14847, a vulnerability affecting multiple supported and legacy MongoDB Server versions. According to the disclosure, the flaw can be exploited remotely by unauthenticated attackers with low complexity, potentially leading to the exfiltration of sensitive data and credentials.
-
Meta Applies Mutation Testing with LLM to Improve Compliance Coverage
Meta applies large language models to mutation testing through its Automated Compliance Hardening system, generating targeted mutants and tests to improve compliance coverage, reduce overhead, and detect privacy and safety risks. The approach supports scalable, LLM-driven test generation and continuous compliance across Meta’s platforms.