InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Vercel Ship AI 2025 Key Announcements and Technical Updates
Vercel Ship AI 2025 unveiled AI SDK 6 beta, new Marketplace agents and services, workflow support for TypeScript, Vercel Agent for code reviews, Python SDK for FastAPI/Flask, and open-source templates for lead enrichment and Slack-SQL queries.
-
OpenAI Launches ChatGPT Atlas, a Browser with ChatGPT Built in
OpenAI has introduced ChatGPT Atlas, a new web browser that integrates ChatGPT into the browsing experience. Rather than functioning as a separate assistant that users have to switch to, Atlas incorporates the model throughout the browsing process. It understands web pages, answers questions, and assists with tasks in real-time.
-
Anthropic Introduces Skills for Custom Claude Tasks
Anthropic has unveiled a new feature called Skills, designed to let developers extend Claude with modular, reusable task components.
-
PyTorch Monarch Simplifies Distributed AI Workflows with a Single-Controller Model
Meta’s PyTorch team has launched Monarch, a framework that simplifies distributed AI workflows across multiple GPUs and machines. It uses a single-controller model to manage computations across a cluster, making large-scale training and reinforcement learning tasks easier while allowing developers to keep their standard PyTorch coding practices.
-
DeepSeek AI Unveils DeepSeek-OCR: Vision-Based Context Compression Redefines Long-Text Processing
DeepSeek AI has developed DeepSeek-OCR, an open-source system that uses optical 2D mapping to compress long text passages. This approach aims to improve how large language models (LLMs) handle text-heavy inputs.
-
Google Research Open-Sources the Coral NPU Platform to Help Build AI into Wearables and Edge Devices
Coral NPU is an open-source full-stack platform designed to help hardware engineers and AI developers overcome the limitations that prevent integrating AI in wearables and edge devices, including performance, fragmentation, and user trust.
-
Google Introduces LLM-Evalkit to Bring Order and Metrics to Prompt Engineering
Google has introduced LLM-Evalkit, an open-source framework built on Vertex AI SDKs, designed to make prompt engineering for large language models less chaotic and more measurable. The lightweight tool aims to replace scattered documents and guess-based iteration with a unified, data-driven workflow.
-
Microsoft Adds Agent Mode and Office Agent to Office Applications
Microsoft has expanded its Microsoft 365 Copilot platform with Agent Mode and Office Agent. The update moves Copilot beyond a conversational assistant into a system capable of running continuous, multi-step workflows across Microsoft 365 applications.
-
Researchers Introduce ACE, a Framework for Self-Improving LLM Contexts
Researchers from Stanford University, SambaNova Systems, and UC Berkeley have proposed Agentic Context Engineering (ACE), a new framework designed to improve large language models (LLMs) through evolving, structured contexts rather than weight updates. The method, described in a paper, seeks to make language models self-improving without retraining.
-
Google’s Open Source Gemini CLI Extensions Let Developers Build Custom AI-Powered Workflows
Google's Gemini CLI Extensions launch an open-source framework empowering developers to create and share integrations effortlessly. With modular architecture and playbooks for seamless tool interaction, Gemini CLI becomes a central hub for AI-assisted workflows. The platform fosters collaboration with prominent partners, enabling a robust ecosystem for personalized developer tools.
-
AWS Launches Amazon Quick Suite, an Agentic AI Workspace
AWS has launched Amazon Quick Suite, a new AI-powered workspace designed to connect company data, automate workflows, and perform actions across business applications.
-
Hugging Face Introduces RTEB, a New Benchmark for Evaluating Retrieval Models
Hugging Face unveils the Retrieval Embedding Benchmark (RTEB), a pioneering framework to assess embedding models' real-world retrieval accuracy. By merging public and private datasets, RTEB narrows the "generalization gap," ensuring models perform reliably across critical sectors. Now live and inviting collaboration, RTEB aims to set a community standard in AI retrieval evaluation.
-
10 AI-Related Standout Sessions at QCon San Francisco 2025
Join us at QCon San Francisco 2025 (Nov 17–21) for a three-day deep dive into the future of software development, exploring AI’s transformative impact. As a program committee member, I’m excited to showcase tracks that tackle real-world challenges, featuring industry leaders and sessions on AI, LLMs, and engineering mindsets. Don’t miss out!
-
Paper2Agent Converts Scientific Papers into Interactive AI Agents
Stanford's Paper2Agent framework revolutionizes research by transforming static papers into interactive AI agents that execute analyses and respond to queries. Leveraging the Model Context Protocol, it simplifies reproducibility and enhances accessibility, empowering users with dynamic, autonomous tools for deeper scientific exploration and understanding.
-
Genkit Extension for Gemini CLI Brings Framework-Aware AI Assistance to the Terminal
Introducing Google's Genkit Extension for Gemini CLI: a groundbreaking tool that delivers framework-aware AI assistance directly to the terminal. Streamline your Genkit application development with context-aware code generation, debugging, and best practices—all without leaving the command line. Unleash productivity and innovation in building generative AI applications.