InfoQ Homepage OpenAI Content on InfoQ
-
OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing
OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.
-
xAI Releases Grok Code Fast 1, a New Model for Agentic Coding
xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows.
-
DeepSeek Releases v3.1 Model with Hybrid Reasoning Architecture
DeepSeek has released version V3.1 of its large language model, introducing a hybrid architecture that combines thinking and non-thinking modes in a single system.
-
OpenAI's GPT-5 Now Generally Available on Microsoft Azure AI Foundry
Microsoft and OpenAI launched GPT-5 on the Azure AI Foundry, revolutionizing enterprise AI with its advanced reasoning and task-specific model orchestration. This suite enhances applications like Microsoft 365 Copilot and GitHub Copilot, enabling seamless interactions and high-quality outputs tailored for real-world scenarios. GPT-5 empowers organizations to drive AI transformation efficiently.
-
OpenAI’s GPT-5 Debuts with Commoditizing Costs and Higher Scrutiny
On August 7, 2025, OpenAI launched GPT-5, introducing advanced capabilities like a 400K token context and improved pricing for production use. Despite initial backlash over perceived performance issues, GPT-5 shows significant reductions in factual errors and enhances model integration through its new Agents SDK. As a result, user engagement has surged, solidifying its market influence.
-
OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment
OpenAI has released gpt-oss-120b and gpt-oss-20b, two open-weight language models designed for high-performance reasoning, tool use, and efficient deployment. These are the company’s first fully open-weight language models since GPT-2, and are available under the permissive Apache 2.0 license.
-
OpenAI Launches Study Mode in ChatGPT to Support Step-by-Step Learning
OpenAI has introduced Study Mode in ChatGPT, a feature intended to guide users through problems in a step-by-step manner rather than supplying immediate answers. It uses interactive prompts, structured responses, and follow-up questions to encourage active engagement and support comprehension.
-
Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools
Mistral has released Voxtral, a large language model aimed at speech recognition (ASR) applications that seek to integrate more advanced LLM-based capabilities and go beyond simple transcription. For two variants of the model, Voxtral Mini (3B) and Voxtral Small (24B), Mistral has released the weights under the Apache 2.0 license.
-
Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service
Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!
-
Anthropic Upgrades App-Building Capabilities to Claude Artifacts
Anthropic has upgraded Claude with new app-building capabilities, allowing users to create, host, and share AI applications directly from text prompts. This functionality, known as Artifacts, enables users to build functional tools like data analyzers, flashcard generators, or study aids by simply describing their ideas.
-
OpenAI Launches o3-pro Model Focused on Reliability, Amid Mixed User Feedback
OpenAI launched o3-pro, a new version of its most advanced model aimed at delivering more reliable, thoughtful responses across complex tasks. Now available to Pro and Team users in ChatGPT and via API, o3-pro replaces the earlier o1-pro.
-
OpenAI Launches Codex Software Engineering Agent Preview
OpenAI has launched Codex, a research preview of a cloud-based software engineering agent designed to automate common development tasks such as writing code, debugging, testing, and generating pull requests. Integrated into ChatGPT, Codex runs each assignment in a secure sandbox environment preloaded with the user's codebase and configured to reflect their development setup.
-
Windsurf Launches SWE-1 Family of Models for Software Engineering
Windsurf has introduced its first set of SWE-1 models, aimed at supporting the full range of software engineering tasks, not limited to code generation. The lineup consists of three models SWE-1, SWE-1-lite, and SWE-1-mini, each designed for specific scenarios.
-
OpenAI’s Stargate Project Aims to Build AI Infrastructure in Partner Countries Worldwide
OpenAI has announced a new initiative called "OpenAI for Countries" as part of its Stargate project, aiming to help nations develop AI infrastructure based on democratic principles. This expansion follows the company's initial $500 billion investment plan for AI infrastructure in the United States.
-
OpenAI Introduces GPT‑4.1 Family with Enhanced Performance and Long-Context Support
OpenAI has released a new family of language models—GPT‑4.1, GPT‑4.1 mini, and GPT‑4.1 nano—available via its API. The models improve on GPT‑4o and GPT‑4.5 across several technical benchmarks and introduce support for up to 1 million tokens of context.