InfoQ Homepage Artificial Intelligence Content on InfoQ
-
Temporal and OpenAI Launch AI Agent Durability with Public Preview Integration
Temporal has unveiled a public preview integration with the OpenAI Agents SDK, introducing durable execution capabilities to AI agent workflows built using OpenAI's framework.
-
Kaggle Introduces Game Arena to Benchmark AI Models in Strategic Games
Kaggle, in collaboration with Google DeepMind, has introduced Kaggle Game Arena, a platform designed to evaluate artificial intelligence models by testing their performance in strategy-based games.
-
From Black Box to Blueprint: Thoughtworks Uses Generative AI to Extract Legacy System Functionality
Thoughtworks consultants successfully harnessed generative AI to decode legacy systems lacking source code. Using Gemini 2.5 Pro, they accelerated reverse engineering, creating validated "blueprints" of functionality in just two weeks. The pilot showcased AI's potential to drastically reduce time and risk in modernizing opaque systems while balancing speed with validation.
-
Introducing the MCP Registry
The Model Context Protocol (MCP) ecosystem is enhancing AI development with a public registry for server discovery and a secure gateway for agent interactions. This initiative, featuring the recently launched MCP Registry and the Linux Foundation's Agentgateway project, streamlines the management of AI tools, fostering collaboration and security for engineering teams.
-
Hugging Face Releases FinePDFs: a 3-Trillion-Token Dataset Built from PDFs
Hugging Face has unveiled FinePDFs, the largest publicly available corpus built entirely from PDFs. The dataset spans 475 million documents in 1,733 languages, totaling roughly 3 trillion tokens. At 3.65 terabytes in size, FinePDFs introduces a new dimension to open training datasets by tapping into a resource long considered too complex and expensive to process.
-
Vercel Introduces AI Gateway for Multi-Model Integration
Vercel has rolled out the AI Gateway for production workloads. The service provides a single API endpoint for accessing a wide range of large language and generative models, aiming to simplify integration and management for developers.
-
Google DeepMind Launches EmbeddingGemma, an Open Model for On-Device Embeddings
Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection.
-
Hugging Face Introduces AI Sheets, a No-Code Tool for Dataset Transformation
Hugging Face has released AI Sheets, an open-source application designed to let users build, transform, and enrich datasets using AI models through a spreadsheet-like interface. The tool, available both on the Hub and for local deployment, allows users to experiment with thousands of open models, including OpenAI’s gpt-oss, without requiring code.
-
xAI Releases Grok Code Fast 1, a New Model for Agentic Coding
xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows.
-
InfoQ Dev Summit Munich 2025: Master the 'How' with Deep-Dive, Practitioner-Led Guidance
At InfoQ Dev Summit Munich (Oct 15-16), learn directly from the senior engineers building complex systems. This practitioner-led conference offers deep dives on real-world implementation patterns from software leaders at Allianz, Skyscanner, Zalando, and Delivery Hero.
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.
-
IBM and NASA Trained the First Foundational Model for Heliophysics
Surya is a 366M-parameter model created by IBM and NASA to forecast solar activity, including flare events, solar winds, and precursors to solar eruptions, which can significantly impact astronaut safety in space as well as terrestrial systems such as communications, power distribution, and more.
-
An AI-Driven Approach to Creating Effective Learning Experiences at QCon
An experiment was created around a certification program influenced by AI at QCon London, which included special events during the conference, a pre-conference breakfast where participants could learn about upcoming activities, and an AI-driven workshop immediately following the conference. Wes Reisz spoke at InfoQ Dev Summit Boston about a program he led using AI.
-
Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified
Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts.
-
AGENTS.md Emerges as Open Standard for AI Coding Agents
A new convention is emerging in the open-source ecosystem: AGENTS.md, a straightforward and open format designed to assist AI coding agents in software development. Already adopted by more than 20,000 repositories on GitHub, the format is being positioned as a companion to traditional documentation, offering machine-readable context that complements human-facing files like README.md.