InfoQ Homepage Architecture & Design Content on InfoQ
-
OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing
OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.
-
Honeycomb Hosted MCP Brings Observability Data into the IDE
Honeycomb has launched its hosted Model Context Protocol (MCP), giving developers real-time access to observability data inside IDEs and AI tools like GitHub Copilot. Available as a managed service on AWS Marketplace, it removes the need for self-hosting and streamlines debugging by surfacing traces, metrics, and logs without context-switching.
-
Uber Shares Strategy for Controlling Risk in Monorepo Changes That Affect 3,000+ Microservices
Uber has published details on their approach to controlling rollouts of large-scale changes across monorepos that serve thousands of microservices, addressing one of the key challenges in continuous deployment at massive scale.
-
Impulse, Airbnb’s New Framework for Context-Aware Load Testing
Airbnb has developed Impulse, an internal load testing framework to improve microservice reliability and performance. It enables distributed, large-scale testing and lets teams run self-service, context-aware load tests integrated with CI pipelines. By simulating production-like traffic, Impulse helps engineers identify bottlenecks and errors before changes reach production.
-
Java News Roundup: OpenJDK, TornadoVM, Payara Platform, Apache Kafka, Grails, Micronaut
This week's Java roundup for September 1st, 2025, features news highlighting: JEP 517 proposed to target for JDK 26; TornadoVM releases GPULlama3.java 0.2.0; the September 2025 edition of the Payara Platform; point releases of Quarkus, Micronaut, Apache Kafka and Apache Tomcat; and second release candidates of Grails 7.0 and Gradle 9.1.
-
Google Spanner Unifies OLTP and OLAP with Columnar Engine
Google Spanner now features a columnar engine, allowing its distributed database to handle both OLTP and OLAP workloads on a single platform. This hybrid architecture eliminates the need for separate data warehouses and ETL pipelines. The engine's columnar storage and vectorized execution accelerate analytical queries up to 200x on live data, which is especially beneficial for AI applications.
-
LinkedIn Re-Architects Edge-Building System to Support Diverse Inference Workflows
LinkedIn has detailed its re-architected edge-building system, an evolution designed to support diverse inference workflows for delivering fresher and more personalized recommendations to members worldwide. The new architecture addresses growing demands for real-time scalability, cost efficiency, and flexibility across its global platform.
-
Agentic AI Expands into SecOps to Ease Human Workloads
Agentic AI is beginning to reshape malware detection and broader security operations. These systems are being used not to replace humans, but to take on the lower value jobs that have historically tied up analysts — from triaging alerts to reverse-engineering suspicious files.
-
Java News Roundup: Groovy 5.0, Project Leyden, Helidon MCP, JReleaser, LangChain4j, Open Liberty
This week's Java roundup for August 25th, 2025, features news highlighting: the GA release of Apache Groovy; a new early-access build of Project Leyden; introducing the Helidon MCP server; point releases of JReleaser, LangChain4j, Quarkus, Camel Quarkus; the beta release of Open Liberty 25.0.0.9; and the first alpha release of Hibernate Validator 9.1.0.
-
Uber Eats Scales Catalog Management from Restaurants to Retail with INCA Framework
Uber Eats introduced INCA (Inventory and Catalog), a scalable system to handle vast product catalogs from supermarkets, pharmacies, and retail partners. Unlike the earlier restaurant-focused setup built for low SKUs and simple pass-through data, INCA supports large-scale inventories, rich metadata, and compliance needs essential for retail operations.
-
AWS Launches Memory-Optimized EC2 R8i and R8i-flex Instances with Custom Intel Xeon 6 Processors
AWS has launched its eighth-generation Amazon EC2 R8i and R8i-flex instances, powered by custom Intel Xeon 6 processors. Designed for memory-intensive workloads, these instances offer up to 15% better price performance and enhanced memory throughput, making them ideal for real-time data processing and AI applications.
-
CNCF Incubates OpenYurt for Kubernetes at the Edge
OpenYurt, a project incubated by the Cloud Native Computing Foundation (CNCF), extends Kubernetes to edge computing, enhancing performance in locations like IoT sites and branch offices. With a growing community and support from industry leaders, OpenYurt focuses on efficient cluster management while maintaining Kubernetes compatibility, ensuring lower latency and improved reliability.
-
AWS Lambda Response Streaming Increases Payload Limit to 200 MB
AWS has revolutionized Lambda with an increased response streaming payload limit from 20 MB to 200 MB. This enhancement allows developers to stream larger data sets, improving Time to First Byte performance. By simplifying response handling and eliminating complex workarounds, AWS empowers developers to deliver rich content seamlessly, transforming serverless applications.
-
Java News Roundup: JDK 25-RC2, Eclipse JNoSQL, Quarkus, Apache TomEE, Apache Camel
This week's Java roundup for August 18th, 2025, features news highlighting: the second release candidate of JDK 25; a point release of Apache Camel; maintenance releases of Eclipse JNoSQL, Quarkus and Apache TomEE; and the Quarkus team implementing the resolution to CVE-2025-55163.
-
Instacart Consolidates Search Infrastructure on PostgreSQL, Phasing out Elasticsearch
Instacart has redesigned its search infrastructure by replacing Elasticsearch with PostgreSQL, combining lexical and embedding-based retrieval in a single database engine. The change was driven by the cost and complexity of maintaining separate systems for search and transactional workloads. The company aimed to simplify operations, reduce overhead, and improve precision in search results.