InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing
OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.
-
Hugging Face Introduces AI Sheets, a No-Code Tool for Dataset Transformation
Hugging Face has released AI Sheets, an open-source application designed to let users build, transform, and enrich datasets using AI models through a spreadsheet-like interface. The tool, available both on the Hub and for local deployment, allows users to experiment with thousands of open models, including OpenAI’s gpt-oss, without requiring code.
-
FerretDB Cloud: Open Source Alternative to MongoDB Atlas?
FerretDB has recently announced the availability of FerretDB Cloud, a managed MongoDB-compatible database service built on open source DocumentDB. Targeting developers seeking the first cross-cloud DocumentDB-based solution and an alternative to MongoDB Atlas, FerretDB Cloud is currently available on AWS only.
-
xAI Releases Grok Code Fast 1, a New Model for Agentic Coding
xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows.
-
Android Studio Narwhal Extends Gemini AI Capabilities
The latest Android Studio Narwhal 3 Feature Drop introduces enhancements aimed at boosting developer productivity, including support for resizable Compose previews, new app Backup & Restore tools, and expanded Gemini capabilities such as automatic code generation from UI screenshots.
-
InfoQ Dev Summit Munich 2025: Master the 'How' with Deep-Dive, Practitioner-Led Guidance
At InfoQ Dev Summit Munich (Oct 15-16), learn directly from the senior engineers building complex systems. This practitioner-led conference offers deep dives on real-world implementation patterns from software leaders at Allianz, Skyscanner, Zalando, and Delivery Hero.
-
Google Launches Gemini 2.5 Flash Image with Advanced Editing and Consistency Features
Google released Gemini 2.5 Flash Image (nicknamed nano-banana), its newest image generation and editing model. The system introduces several upgrades over earlier Flash models, including character consistency across prompts, multi-image fusion, precise prompt-based editing, and integration of world knowledge for semantic understanding.
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.
-
Hugging Face Releases Trackio, a Lightweight Open-Source Experiment Tracking Library
Hugging Face has introduced Trackio, a new open-source Python library for experiment tracking designed to be lightweight, transparent, and easy to integrate. Built as a drop-in replacement for Weights & Biases (wandb), Trackio offers local dashboards by default and seamless syncing with Hugging Face Spaces for sharing and collaboration.
-
Agentic AI Expands into SecOps to Ease Human Workloads
Agentic AI is beginning to reshape malware detection and broader security operations. These systems are being used not to replace humans, but to take on the lower value jobs that have historically tied up analysts — from triaging alerts to reverse-engineering suspicious files.
-
IBM and NASA Trained the First Foundational Model for Heliophysics
Surya is a 366M-parameter model created by IBM and NASA to forecast solar activity, including flare events, solar winds, and precursors to solar eruptions, which can significantly impact astronaut safety in space as well as terrestrial systems such as communications, power distribution, and more.
-
Pogocache: Open Source Caching Software with Low Latency and Multiple Wire Protocols
A new open-source caching software, Pogocache, recently reached 1.0 general availability, focusing on low latency and CPU efficiency. Pogocache supports multiple popular protocols while claiming better throughput and lower latency than other open-source caching alternatives.
-
AWS Launches Memory-Optimized EC2 R8i and R8i-flex Instances with Custom Intel Xeon 6 Processors
AWS has launched its eighth-generation Amazon EC2 R8i and R8i-flex instances, powered by custom Intel Xeon 6 processors. Designed for memory-intensive workloads, these instances offer up to 15% better price performance and enhanced memory throughput, making them ideal for real-time data processing and AI applications.
-
Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified
Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts.
-
AGENTS.md Emerges as Open Standard for AI Coding Agents
A new convention is emerging in the open-source ecosystem: AGENTS.md, a straightforward and open format designed to assist AI coding agents in software development. Already adopted by more than 20,000 repositories on GitHub, the format is being positioned as a companion to traditional documentation, offering machine-readable context that complements human-facing files like README.md.