InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Opera Unveils Opera Neon: A New Browser Built Around AI Agents
Opera has introduced Opera Neon, a new browser that goes beyond traditional web navigation by integrating AI agents capable of interpreting user intent, performing tasks, and supporting creative workflows.
-
QCon AI New York 2025: Program Committee Announced
Meet the QCon AI New York Program Committee, senior software leaders shaping a practical AI conference for engineers building at scale.
-
Google Cloud Run Now Offers Serverless GPUs for AI and Batch Processing
Google Cloud has launched NVIDIA GPU support for Cloud Run, enhancing its serverless platform with scalable, cost-efficient GPU resources. This upgrade enables rapid AI inference and batch processing, featuring pay-per-second billing and automatic scaling to zero. Developers can access seamless GPU support easily, making advanced AI applications faster and more accessible.
-
Surfing the Web at Scale: Orca Explores a Human-Guided Future for AI Agents
Researchers at UC San Diego have released Orca, an open-source system that demonstrates how large language models (LLMs) can assist users on the web—not by taking control, but by guiding interaction.
-
Anthropic Open-Sources Tool to Trace the "Thoughts" of Large Language Models
Anthropic researchers have open-sourced the tool they used to trace what goes on inside a large language model during inference. It includes a circuit tracing Python library that can be used with any open-weights model and a frontend hosted on Neuropedia to explore the library output through a graph.
-
Introducing ANS: DNS-Inspired Secure Discovery for AI Agents
The Open Worldwide Application Security Project (OWASP) has recently introduced a new standard for securely discovering AI agents. Inspired by DNS, the Agent Name Service (ANS) provides a protocol-agnostic registry mechanism that uses Public Key Infrastructure (PKI) to establish agent identity and trust.
-
Amazon Aurora DSQL Goes GA: Distributed, PostgreSQL-Compatible Serverless Database
Amazon has recently announced the general availability of Amazon Aurora DSQL, a PostgreSQL-compatible, serverless, and distributed database. The new managed service is designed to support active-active high availability and multi-region strong consistency.
-
AWS Introduces Open Source Model Context Protocol Servers for ECS, EKS, and Serverless
AWS has launched open-source Model Context Protocol (MCP) servers on GitHub to supercharge AI development within Amazon ECS, EKS, and Serverless environments. These specialized tools equip developers with real-time, context-specific insights, enhancing application deployment, troubleshooting, and operational efficiency. Empower your cloud experience today!
-
Perplexity Introduces Labs for Project-Based AI Workflows
Perplexity has released Labs, a new feature for Pro subscribers designed to support more complex tasks beyond question answering. The update marks a shift from search-based interactions toward structured, multi-step workflows powered by generative AI.
-
Another Rust Rewrite: OpenAI’s Codex CLI Goes Native, Drops Node and TypeScript for Rust
OpenAI recently announced rewriting its Codex CLI in Rust. Codex CLI stack originally features React, TypeScript and Node. The rewrite seeks security and performance gains on top of improved developer experience.
-
Google Brings Gemini Nano to ML Kit with New On-Device GenAI APIs
The new GenAI APIs recently added to ML Kit enable developers to use Gemini Nano for on-device inference in Android apps, supporting features like summarization, proofreading, rewriting, and image description.
-
Anthropic Introduces Claude 4 Family and Claude Code
Anthropic released Claude Opus 4 and Sonnet 4, the newest versions of their Claude series of LLMs. Both models support extended thinking, tool use, and memory improvements, and Claude 4 Opus outperforms other LLMs on coding benchmarks.
-
Amazon Open Sources Strands Agents SDK for Building AI Agents
Amazon has released Strands Agents, an open source SDK that simplifies AI agent development through a model-driven approach. The framework enables developers to build agents by defining prompts and tool lists with minimal code.
-
Google Releases LMEval, an Open-Source Cross-Provider LLM Evaluation Tool
LMEval aims to help AI researchers and developers compare the performance of different large language models. Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.
-
Azure AI Search Unveils Agentic Retrieval for Smarter Conversational AI
Microsoft’s Azure AI Search unveils agentic retrieval, a cutting-edge query engine that enhances conversational AI answer relevance by up to 40%. This dynamic system leverages conversation history and parallel subquery execution, paving the way for sophisticated knowledge retrieval. Currently in public preview, it offers adaptive search strategies tailored for evolving enterprise needs.