InfoQ Homepage Artificial Intelligence Content on InfoQ
-
Hugging Face Releases FinePDFs: a 3-Trillion-Token Dataset Built from PDFs
Hugging Face has unveiled FinePDFs, the largest publicly available corpus built entirely from PDFs. The dataset spans 475 million documents in 1,733 languages, totaling roughly 3 trillion tokens. At 3.65 terabytes in size, FinePDFs introduces a new dimension to open training datasets by tapping into a resource long considered too complex and expensive to process.
-
Vercel Introduces AI Gateway for Multi-Model Integration
Vercel has rolled out the AI Gateway for production workloads. The service provides a single API endpoint for accessing a wide range of large language and generative models, aiming to simplify integration and management for developers.
-
Google DeepMind Launches EmbeddingGemma, an Open Model for On-Device Embeddings
Google DeepMind has introduced EmbeddingGemma, a 308M parameter open embedding model designed to run efficiently on-device. The model aims to make applications like retrieval-augmented generation (RAG), semantic search, and text classification accessible without the need for a server or internet connection.
-
Hugging Face Introduces AI Sheets, a No-Code Tool for Dataset Transformation
Hugging Face has released AI Sheets, an open-source application designed to let users build, transform, and enrich datasets using AI models through a spreadsheet-like interface. The tool, available both on the Hub and for local deployment, allows users to experiment with thousands of open models, including OpenAI’s gpt-oss, without requiring code.
-
xAI Releases Grok Code Fast 1, a New Model for Agentic Coding
xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows.
-
InfoQ Dev Summit Munich 2025: Master the 'How' with Deep-Dive, Practitioner-Led Guidance
At InfoQ Dev Summit Munich (Oct 15-16), learn directly from the senior engineers building complex systems. This practitioner-led conference offers deep dives on real-world implementation patterns from software leaders at Allianz, Skyscanner, Zalando, and Delivery Hero.
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.
-
IBM and NASA Trained the First Foundational Model for Heliophysics
Surya is a 366M-parameter model created by IBM and NASA to forecast solar activity, including flare events, solar winds, and precursors to solar eruptions, which can significantly impact astronaut safety in space as well as terrestrial systems such as communications, power distribution, and more.
-
An AI-Driven Approach to Creating Effective Learning Experiences at QCon
An experiment was created around a certification program influenced by AI at QCon London, which included special events during the conference, a pre-conference breakfast where participants could learn about upcoming activities, and an AI-driven workshop immediately following the conference. Wes Reisz spoke at InfoQ Dev Summit Boston about a program he led using AI.
-
Anthropic’s Claude Opus 4.1 Improves Refactoring and Safety, Scores 74.5% SWE-bench Verified
Anthropic has launched Claude Opus 4.1, an update that strengthens coding reliability in multi-file projects and improves reasoning across long interactions. The model also raised its SWE-bench Verified score to 74.5%, up from 72.5%. Building on Opus 4, the new version strengthens Claude’s ability to act as a coding assistant, particularly in multi-file contexts.
-
AGENTS.md Emerges as Open Standard for AI Coding Agents
A new convention is emerging in the open-source ecosystem: AGENTS.md, a straightforward and open format designed to assist AI coding agents in software development. Already adopted by more than 20,000 repositories on GitHub, the format is being positioned as a companion to traditional documentation, offering machine-readable context that complements human-facing files like README.md.
-
AWS CCAPI MCP Server: Natural Language Infra
AWS introduces the Cloud Control API (CCAPI) MCP Server, revolutionizing infrastructure management by enabling natural language commands for resource management. This tool boosts developer productivity with automated security checks, IaC template generation, and cost estimation, bridging the gap between intent and cloud deployment. Embrace simplicity and efficiency in cloud operations!
-
Container Use: a New Tool for Isolated, Parallel Coding Agents
The Dagger team has released Container Use, an open-source tool designed to streamline how AI-based coding agents operate by giving each one its own containerized sandbox and Git worktree, enabling parallel, conflict-free workflows.
-
OpenAI's GPT-5 Now Generally Available on Microsoft Azure AI Foundry
Microsoft and OpenAI launched GPT-5 on the Azure AI Foundry, revolutionizing enterprise AI with its advanced reasoning and task-specific model orchestration. This suite enhances applications like Microsoft 365 Copilot and GitHub Copilot, enabling seamless interactions and high-quality outputs tailored for real-world scenarios. GPT-5 empowers organizations to drive AI transformation efficiently.
-
Claude Code Subagents Enable Modular AI Workflows with Isolated Context
Anthropic has recently made Claude Code Subagents generally available, enabling developers to create independent, task-specific AI agents with their own context, tools, and prompts.