InfoQ Homepage Architecture & Design Content on InfoQ
-
How LinkedIn Built Enterprise Multi-Agent AI on Existing Messaging Infrastructure
LinkedIn extended its generative AI application platform to support multi-agent systems by repurposing its existing messaging infrastructure as an orchestration layer. This allowed the company to scale AI agents without building new coordination technology from scratch and achieve global availability while supporting complex multi-step workflows through agent coordination.
-
Java News Roundup: OpenJDK JEPs, TornadoVM, Spring Framework, Open Liberty, JBang
This week's Java roundup for September 8th, 2025, features news highlighting: OpenJDK JEPs targeted for JDK 26 and new candidates; first integration of GPULlama3.java with LangChain4j; milestone releases of Spring Framework, Spring Data and Spring AI; Spring Authorization Server moving to Spring Security; the September 2025 edition of Open Liberty; and a point release of JBang.
-
Azure Service Groups Enter Public Preview Offering New Abstraction Layer for Resource Management
Microsoft has launched Azure Service Groups in public preview, a new feature designed to simplify resource management and administration. Acting as a flexible, tenant-level container, Service Groups allow users to organize Azure resources from anywhere within their tenant without affecting RBAC or policy inheritance.
-
OpenAI’s gpt-realtime Enables Production-Ready Voice Agents with End-to-End Speech Processing
OpenAI launched gpt-realtime and the Realtime API, enabling production-ready AI voice agents with end-to-end speech processing, lower latency, and natural speech delivery. New features include SIP phone support, image input, MCP server integration, and improved safeguards. Early adopters like Zillow and T-Mobile are testing real-time customer service and search use cases.
-
Honeycomb Hosted MCP Brings Observability Data into the IDE
Honeycomb has launched its hosted Model Context Protocol (MCP), giving developers real-time access to observability data inside IDEs and AI tools like GitHub Copilot. Available as a managed service on AWS Marketplace, it removes the need for self-hosting and streamlines debugging by surfacing traces, metrics, and logs without context-switching.
-
Uber Shares Strategy for Controlling Risk in Monorepo Changes That Affect 3,000+ Microservices
Uber has published details on their approach to controlling rollouts of large-scale changes across monorepos that serve thousands of microservices, addressing one of the key challenges in continuous deployment at massive scale.
-
Impulse, Airbnb’s New Framework for Context-Aware Load Testing
Airbnb has developed Impulse, an internal load testing framework to improve microservice reliability and performance. It enables distributed, large-scale testing and lets teams run self-service, context-aware load tests integrated with CI pipelines. By simulating production-like traffic, Impulse helps engineers identify bottlenecks and errors before changes reach production.
-
Java News Roundup: OpenJDK, TornadoVM, Payara Platform, Apache Kafka, Grails, Micronaut
This week's Java roundup for September 1st, 2025, features news highlighting: JEP 517 proposed to target for JDK 26; TornadoVM releases GPULlama3.java 0.2.0; the September 2025 edition of the Payara Platform; point releases of Quarkus, Micronaut, Apache Kafka and Apache Tomcat; and second release candidates of Grails 7.0 and Gradle 9.1.
-
Google Spanner Unifies OLTP and OLAP with Columnar Engine
Google Spanner now features a columnar engine, allowing its distributed database to handle both OLTP and OLAP workloads on a single platform. This hybrid architecture eliminates the need for separate data warehouses and ETL pipelines. The engine's columnar storage and vectorized execution accelerate analytical queries up to 200x on live data, which is especially beneficial for AI applications.
-
LinkedIn Re-Architects Edge-Building System to Support Diverse Inference Workflows
LinkedIn has detailed its re-architected edge-building system, an evolution designed to support diverse inference workflows for delivering fresher and more personalized recommendations to members worldwide. The new architecture addresses growing demands for real-time scalability, cost efficiency, and flexibility across its global platform.
-
Agentic AI Expands into SecOps to Ease Human Workloads
Agentic AI is beginning to reshape malware detection and broader security operations. These systems are being used not to replace humans, but to take on the lower value jobs that have historically tied up analysts — from triaging alerts to reverse-engineering suspicious files.
-
Java News Roundup: Groovy 5.0, Project Leyden, Helidon MCP, JReleaser, LangChain4j, Open Liberty
This week's Java roundup for August 25th, 2025, features news highlighting: the GA release of Apache Groovy; a new early-access build of Project Leyden; introducing the Helidon MCP server; point releases of JReleaser, LangChain4j, Quarkus, Camel Quarkus; the beta release of Open Liberty 25.0.0.9; and the first alpha release of Hibernate Validator 9.1.0.
-
Uber Eats Scales Catalog Management from Restaurants to Retail with INCA Framework
Uber Eats introduced INCA (Inventory and Catalog), a scalable system to handle vast product catalogs from supermarkets, pharmacies, and retail partners. Unlike the earlier restaurant-focused setup built for low SKUs and simple pass-through data, INCA supports large-scale inventories, rich metadata, and compliance needs essential for retail operations.
-
AWS Launches Memory-Optimized EC2 R8i and R8i-flex Instances with Custom Intel Xeon 6 Processors
AWS has launched its eighth-generation Amazon EC2 R8i and R8i-flex instances, powered by custom Intel Xeon 6 processors. Designed for memory-intensive workloads, these instances offer up to 15% better price performance and enhanced memory throughput, making them ideal for real-time data processing and AI applications.
-
CNCF Incubates OpenYurt for Kubernetes at the Edge
OpenYurt, a project incubated by the Cloud Native Computing Foundation (CNCF), extends Kubernetes to edge computing, enhancing performance in locations like IoT sites and branch offices. With a growing community and support from industry leaders, OpenYurt focuses on efficient cluster management while maintaining Kubernetes compatibility, ensuring lower latency and improved reliability.