InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Cloudflare Introduces Data Platform with Zero Egress Fees
Cloudflare has recently announced the open beta of Cloudflare Data Platform, a managed solution for ingesting, storing, and querying analytical data tables using open standards such as Apache Iceberg.
-
How Meta Is Using AI to Standardize and Cut Carbon Emissions
Meta has developed an AI-based approach to improve the quality of Scope 3 emissions estimates across its IT hardware supply chain. The method combines machine learning and generative models to classify hardware components and infer missing product carbon footprint (PCF) data.
-
PyTorch Foundation Welcomes Ray, Unveils Monarch for Simplified Distributed AI
At the 2025 PyTorch Conference, the PyTorch Foundation unveiled new initiatives to advance open AI infrastructure, introducing frameworks like Ray and PyTorch Monarch for streamlined distributed workloads. Highlighting transparency in AI development, new projects from Stanford and AI2 aim to enhance reproducibility. The foundation is solidifying its role as a central hub for scalable AI solutions.
-
Inside the Architectures Powering Modern AI Systems: QCon San Francisco 2025
Senior engineers face fast-moving AI adoption without clear patterns. QCon SF 2025 brings real-world lessons from teams at Netflix, Meta, Intuit, Anthropic & more, showing how to build reliable AI systems at scale. Early bird ends Nov 11.
-
Anthropic Expands Claude Code to Web and Mobile Platforms
Anthropic has expanded the availability of Claude Code, its AI-powered development environment, bringing it to the web and mobile platforms.
-
The Architectural Shift: AI Agents Become Execution Engines While Backends Retreat to Governance
A fundamental shift in enterprise software architecture is emerging as AI agents transition from assistive tools to operational execution engines, with traditional application backends retreating to governance and permission management roles. This transformation is accelerating across sectors, with 40% of enterprise applications expected to include autonomous agents by 2026.
-
NVIDIA Introduces OmniVinci, a Research-Only LLM for Cross-Modal Understanding
NVIDIA has introduced OmniVinci, a large language model designed to understand and reason across multiple input types — including text, vision, audio, and even robotics data. The project, developed by NVIDIA Research, aims to push machine intelligence closer to human-like perception by unifying how models interpret the world across different sensory streams.
-
Vercel Ship AI 2025 Key Announcements and Technical Updates
Vercel Ship AI 2025 unveiled AI SDK 6 beta, new Marketplace agents and services, workflow support for TypeScript, Vercel Agent for code reviews, Python SDK for FastAPI/Flask, and open-source templates for lead enrichment and Slack-SQL queries.
-
OpenAI Launches ChatGPT Atlas, a Browser with ChatGPT Built in
OpenAI has introduced ChatGPT Atlas, a new web browser that integrates ChatGPT into the browsing experience. Rather than functioning as a separate assistant that users have to switch to, Atlas incorporates the model throughout the browsing process. It understands web pages, answers questions, and assists with tasks in real-time.
-
Anthropic Introduces Skills for Custom Claude Tasks
Anthropic has unveiled a new feature called Skills, designed to let developers extend Claude with modular, reusable task components.
-
PyTorch Monarch Simplifies Distributed AI Workflows with a Single-Controller Model
Meta’s PyTorch team has launched Monarch, a framework that simplifies distributed AI workflows across multiple GPUs and machines. It uses a single-controller model to manage computations across a cluster, making large-scale training and reinforcement learning tasks easier while allowing developers to keep their standard PyTorch coding practices.
-
DeepSeek AI Unveils DeepSeek-OCR: Vision-Based Context Compression Redefines Long-Text Processing
DeepSeek AI has developed DeepSeek-OCR, an open-source system that uses optical 2D mapping to compress long text passages. This approach aims to improve how large language models (LLMs) handle text-heavy inputs.
-
Google Research Open-Sources the Coral NPU Platform to Help Build AI into Wearables and Edge Devices
Coral NPU is an open-source full-stack platform designed to help hardware engineers and AI developers overcome the limitations that prevent integrating AI in wearables and edge devices, including performance, fragmentation, and user trust.
-
Google Introduces LLM-Evalkit to Bring Order and Metrics to Prompt Engineering
Google has introduced LLM-Evalkit, an open-source framework built on Vertex AI SDKs, designed to make prompt engineering for large language models less chaotic and more measurable. The lightweight tool aims to replace scattered documents and guess-based iteration with a unified, data-driven workflow.
-
Microsoft Adds Agent Mode and Office Agent to Office Applications
Microsoft has expanded its Microsoft 365 Copilot platform with Agent Mode and Office Agent. The update moves Copilot beyond a conversational assistant into a system capable of running continuous, multi-step workflows across Microsoft 365 applications.