InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

OpenAI Releases GPT-5-Codex Optimized for Complex Code Refactoring and Code Reviews

Introducing GPT-5-Codex: OpenAI's latest AI model revolutionizing software engineering with advanced capabilities in code refactoring and review. Operating autonomously for over 7 hours, it ensures efficiency and accuracy, achieving 51.3% accuracy in complex tasks. Adaptively reasoning, it enhances developer workflows, producing high-quality, tested code while minimizing noise.

Hien Luu
on Sep 22, 2025
Architecture & Design

Datadog Launches Monocle, a Unified Rust-Powered Real-Time Metrics Engine

Datadog has launched Monocle, a new real-time time series storage engine written in Rust. The system unifies the company’s metrics storage infrastructure, delivering higher ingestion throughput and lower query latency while reducing operational complexity. Monocle replaces several generations of storage backends, addressing concurrency challenges and scaling limits that accumulated over time.

Leela Kumili
on Sep 22, 2025
AI, ML & Data Engineering

Replit Introduces Agent 3 for Extended Autonomous Coding and Automation

Replit has introduced Agent 3, its latest autonomous software agent built to extend the use of AI in programming and workflow automation. Unlike earlier coding assistants that provide small pieces of help through autocomplete or single-step code generation, Agent 3 is designed to carry out tasks over an extended period of time.

Daniel Dominguez
on Sep 22, 2025
Culture & Methods

Open Practices for Architecture and AI Adoption

Andrea Magnorsky presented on Byte-Sized Architecture at Cloud Native Summit 2025, as a format for building shared understanding through small, recurrent workshops. Ahilan Ponnusamy and Andreas Spanner discussed the Technology Operating Model for AI adoption. Both approaches drew on the Open Practice Library for human-centred collaboration and driving architectural evolution.

Rafiq Gemmail
on Sep 18, 2025
AI, ML & Data Engineering

Hugging Face Brings Open-Source LLMs to GitHub Copilot Chat in VS Code

Hugging Face has introduced a new integration that allows developers to connect Inference Providers directly with GitHub Copilot Chat in Visual Studio Code. The update means that open-source large language models — including Kimi K2, DeepSeek V3.1, GLM 4.5, and others — can now be accessed and tested from inside the VS Code editor, without the need to switch platforms or juggle multiple tools.

Robert Krzaczyński
on Sep 17, 2025
AI, ML & Data Engineering

Kaggle Introduces Game Arena to Benchmark AI Models in Strategic Games

Kaggle, in collaboration with Google DeepMind, has introduced Kaggle Game Arena, a platform designed to evaluate artificial intelligence models by testing their performance in strategy-based games.

Daniel Dominguez
on Sep 16, 2025
AI, ML & Data Engineering

Introducing the MCP Registry

The Model Context Protocol (MCP) ecosystem is enhancing AI development with a public registry for server discovery and a secure gateway for agent interactions. This initiative, featuring the recently launched MCP Registry and the Linux Foundation's Agentgateway project, streamlines the management of AI tools, fostering collaboration and security for engineering teams.

Andrew Hoblitzell
on Sep 15, 2025
Architecture & Design

How LinkedIn Built Enterprise Multi-Agent AI on Existing Messaging Infrastructure

LinkedIn extended its generative AI application platform to support multi-agent systems by repurposing its existing messaging infrastructure as an orchestration layer. This allowed the company to scale AI agents without building new coordination technology from scratch and achieve global availability while supporting complex multi-step workflows through agent coordination.

Eran Stiller
on Sep 15, 2025
AI, ML & Data Engineering

Hugging Face Releases FinePDFs: a 3-Trillion-Token Dataset Built from PDFs

Hugging Face has unveiled FinePDFs, the largest publicly available corpus built entirely from PDFs. The dataset spans 475 million documents in 1,733 languages, totaling roughly 3 trillion tokens. At 3.65 terabytes in size, FinePDFs introduces a new dimension to open training datasets by tapping into a resource long considered too complex and expensive to process.

Robert Krzaczyński
on Sep 15, 2025
Cloud

Cloudflare Introduces Automated Scoring for Shadow AI Risk Assessment

During AI Week 2025, Cloudflare announced Application Confidence Scores, an automated assessment system that is designed to help organizations evaluate the safety and security of third-party AI applications at scale.

Renato Losio
on Sep 13, 2025
AI, ML & Data Engineering

Vercel Introduces AI Gateway for Multi-Model Integration

Vercel has rolled out the AI Gateway for production workloads. The service provides a single API endpoint for accessing a wide range of large language and generative models, aiming to simplify integration and management for developers.

Daniel Dominguez
on Sep 12, 2025
AI, ML & Data Engineering

Google DeepMind Launches EmbeddingGemma, an Open Model for On-Device Embeddings

Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection.

Robert Krzaczyński
on Sep 11, 2025
AI, ML & Data Engineering

OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing

OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.

Hien Luu
on Sep 11, 2025
AI, ML & Data Engineering

Hugging Face Introduces AI Sheets, a No-Code Tool for Dataset Transformation

Hugging Face has released AI Sheets, an open-source application designed to let users build, transform, and enrich datasets using AI models through a spreadsheet-like interface. The tool, available both on the Hub and for local deployment, allows users to experiment with thousands of open models, including OpenAI’s gpt-oss, without requiring code.

Robert Krzaczyński
on Sep 08, 2025
Cloud

FerretDB Cloud: Open Source Alternative to MongoDB Atlas?

FerretDB has recently announced the availability of FerretDB Cloud, a managed MongoDB-compatible database service built on open source DocumentDB. Targeting developers seeking the first cross-cloud DocumentDB-based solution and an alternative to MongoDB Atlas, FerretDB Cloud is currently available on AWS only.

Renato Losio
on Sep 06, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News