InfoQ Homepage API Content on InfoQ
-
Qwen Team Releases Qwen3-Coder, a Large Agentic Coding Model with Open Tooling
Qwen Team has announced Qwen3-Coder, a new family of agentic code models designed for long-context, multi-step programming tasks. The most capable variant, Qwen3-Coder-480B-A35B-Instruct, is a Mixture-of-Experts model with a total of 480 billion parameters and 35 billion active parameters per forward pass.
-
Google Apigee Adds Built-in LLM Governance with Model Armor
Google Cloud has launched the public preview of Model Armor, a native LLM governance framework integrated into the Apigee API management platform. Detailed in a community post, Model Armor introduces out-of-the-box enforcement for LLM-specific policies such as prompt validation, output filtering, and token-level controls at the API layer.
-
Inside Netflix’s Title Launch Observability System: Validating Title Availability at Global Scale
Netflix has developed a platform called Title Launch Observability, which shifts observability from system health to product intent. Instead of relying solely on logs and metrics, the system validates launches against what users should see, catching content quality issues early. The platform helps detect issues such as missing artwork, incorrect recommendations, or localization gaps.
-
Google DeepMind Unveils AlphaGenome: a Unified AI Model for High-Resolution Genome Interpretation
Google DeepMind has announced the release of AlphaGenome, a new AI model designed to predict how genetic variants affect gene regulation across the entire genome. It represents a significant advancement in computational genomics by integrating long-range sequence context with base-pair resolution in a single, general-purpose architecture.
-
AWS Open-Sources Smithy API Models for Public Access and Developer Tooling
AWS has unveiled public access to its comprehensive API models, delivering daily updates and open-source resources via GitHub. By utilizing Smithy, AWS aims to empower developers to build custom SDK clients, server stubs, and innovative tools, enhancing integration and understanding of AWS APIs. Explore the potential of API development with these powerful new tools!
-
Amazon API Gateway Adds Dynamic Routing Based on Headers and Paths
AWS's new dynamic routing rules for Amazon API Gateway empower developers to streamline API traffic management by routing requests based on HTTP headers without complex URL structures. This innovative feature simplifies API versioning, enables fine-grained control, enhances A/B testing, and improves request visibility, making API configurations more efficient and user-friendly.
-
FastAPI-MCP: Simplifying the Integration of FastAPI with AI Agents
A new open-source library, FastAPI-MCP, is making it easier for developers to connect traditional FastAPI applications with modern AI agents through the Model Context Protocol (MCP). Designed for zero-configuration setup, FastAPI-MCP allows developers to automatically expose their API endpoints as MCP-compatible tools.
-
OpenAI Announces ‘o3’ Reasoning Model
OpenAI has launched the O3 and O3 Mini models, setting a new standard in AI with enhanced reasoning capabilities. Notable achievements include 71.7% accuracy on SWE-Bench and 96.7% on the AIME benchmark. While these models excel in coding and mathematics, challenges remain. O3 Mini offers scalable options for developers, prioritizing safety and adaptability.
-
OpenAI Releases Sora and Full Version of O1 Reasoning Model with Fine-Tuning
OpenAI has unveiled its advanced o1 reasoning model and the video generation model Sora, enhancing complex reasoning and video creation capabilities. Sora produces high-quality videos using innovative diffusion techniques, while o1 excels in nuanced reasoning and safety. Together, they signal a transformative leap in AI, bridging creativity and rigorous reasoning.
-
DevProxy 0.22 Improves API Permission Checks
Microsoft has released version 0.22 of DevProxy, an API simulation command-line tool. The new version improves logging and detects minimal permissions without the need for Azure API centre.
-
ASP.NET Core 9: Enhancements in Static Asset Handling, Blazor, SignalR, and OpenAPI Support
Microsoft has released .NET 9, which contains features regarding ASP.NET Core 9. This latest release focuses on optimizing static asset handling, refining Blazor's component interaction, enhancing SignalR's observability and performance, and streamlining API documentation through built-in OpenAPI support.
-
Google Debuts OpenAI-compatible API for Gemini
In an effort to make it easier for developers who adopted OpenAI for their LLM-based solutions to switch to Gemini, Google has launched a new endpoint for its Gemini API that allows them to easily switch from one service to the other. The new endpoint is still in beta and provides only partial coverage of OpenAI capabilities.
-
xAI Unveils a New API Service for Grok Models
Elon Musk’s xAI has launched a public beta for its API service, enabling developers to integrate xAI's large language models (LLMs) into their applications.
-
OpenAI Launches Public Beta of Realtime API for Low-Latency Speech Interactions
OpenAI launched the public beta of the Realtime API, offering developers the ability to create low-latency, multimodal voice interactions within their applications. Additionally, audio input/output is now available in the Chat Completions API, expanding options for voice-driven applications. Early feedback highlights limited voice options and response cutoffs.
-
OpenAI Developer Day 2024 (SF) Announces Real-Time API, Vision Fine-Tuning, and More
On October 1, 2024, OpenAI SF DevDay unveiled innovative features, including a Real-Time API enabling instant voice interactions and function calling. Enhanced model distillation and vision fine-tuning empower developers to customize AI for diverse applications. Upcoming events in London and Singapore will further expand these capabilities.