InfoQ Homepage Architecture & Design Content on InfoQ
-
Java News Roundup: Spring Shell, JReleaser, TornadoInsight, Apache Camel
This week's relatively quiet Java roundup for December 29th, 2025, features news highlighting: the GA release of Spring Shell 4.0; point releases of JReleaser 1.22.0 and Apache Camel 4.14.3; and TornadoInsight now compatible with the recent release of TornadoVM 2.0.
-
Swiggy Rolls out Hermes V3: from Text-to-SQL to Conversational AI
Swiggy has released Hermes V3, a GenAI-powered text-to-SQL assistant that enables employees to query data in plain English. The Slack-native system combines vector retrieval, conversational memory, agentic orchestration, and explainability to improve SQL accuracy and support multi-turn analytical queries.
-
Amazon S3 Vectors Reaches GA, Introducing "Storage-First" Architecture for RAG
AWS has announced the general availability of Amazon S3 Vectors, increasing per-index capacity forty-fold to 2 billion vectors. By natively integrating vector search into the S3 storage engine, the service introduces a "Storage-First" architecture that decouples compute from storage, reducing total cost of ownership by up to 90% for large-scale RAG workloads.
-
Kubernetes 1.35 Released with In-Place Pod Resize and AI-Optimized Scheduling
The Cloud Native Computing Foundation (CNCF) announced the release of Kubernetes 1.35, named "Timbernetes", emphasizing its focus on mutability and the optimization of high-performance AI/ML workloads.
-
Architectural Lessons From Patreon's Year in Review
In 2025, Patreon’s engineering team expertly balanced feature delivery for 10M+ members with vital infrastructure upgrades. Their Year in Review highlights 12 projects focused on maintenance and evolution, emphasizing resilient migration, data model refactoring, and strategic consistency trade-offs, ultimately redefining backend operations while enhancing system reliability and performance.
-
Microsoft Foundry Agent Service Simplifies State Management with Long-Term Memory Preview
Microsoft has launched a public preview of a managed long-term memory store for its Foundry Agent Service. The service automates the extraction, consolidation, and retrieval of user context, providing a native "state layer" that prevents intelligence decay in long-running interactions with AI agents.
-
CNCF Launches Certified Kubernetes AI Conformance Program to Standardise Workloads
The CNCF has launched the Certified Kubernetes AI Conformance program to standardise artificial intelligence workloads. By establishing a technical baseline for GPU management, networking, and gang scheduling, the initiative ensures portability across cloud providers. It aims to reduce technical debt and prevent vendor lock-in as enterprises move generative AI models into production.
-
AWS Expands Well‑Architected Guidance with Data Residency and Hybrid Cloud Lens
Earlier this year, AWS launched the Well-Architected Data Residency with Hybrid Cloud Services Lens, providing guidance for hybrid cloud workloads. The lens covers data classification, operational practices, automation, and compliance, helping organizations manage data location while optimizing security, cost, and resilience.
-
Java News Roundup: Spring Vault, LangChain4j, Seed4J, Infinispan, Gradle
This week's Java roundup for December 22nd, 2025, features news highlighting: new interfaces, VaultClient and ReactiveVaultClient, in Spring Vault; point releases for LangChain4j and Seed4J; maintenance releases for Micronaut, Quarkus and Infinispan; and the second release candidate of Gradle 9.3.
-
How Authress Designed for Resilience and Survived a Major AWS Outage
Identity and authentication services company Authress shared its strategy to stay operational during major cloud infrastructure outages like the massive October 2025 AWS outage that disrupted many major services. According to Authress CTO Warren Parad, the company's resilience architecture relies on strategies like multi-region deployment and minimizing reliance on AWS control plane services.
-
AWS Launches Network Firewall Proxy in Preview to Simplify Managed Egress Security
AWS has unveiled the preview of its Network Firewall proxy, a managed service that optimizes proxy management and enhances outbound security for VPCs. Integrated with NAT Gateway, this tool inspects traffic through a three-phase model and supports both TLS interception and centralized models via Transit Gateway. Currently available in East Ohio.
-
Cloudflare Open Sources tokio‑quiche, Promising Easier QUIC and HTTP/3 in Rust
Cloudflare has open-sourced tokio-quiche, an asynchronous QUIC and HTTP/3 Rust library that wraps its battle-tested quiche implementation with the Tokio runtime to simplify the development of high-performance QUIC applications. The library was used internally to back the edge services, the Oxy HTTP proxies or MASQUE-based tunnels replacing the Wireguard-based tunnels in the WARP client.
-
Uber Adopts Amazon OpenSearch for Semantic Search to Better Capture User Intent
To improve search and recommendation user experiences, Uber migrated from Apache Lucene to Amazon OpenSearch to support large-scale vector search and better capture search intent. This transition introduced several infrastructure challenges, which Uber engineers addressed with targeted solutions.
-
Benchmarking beyond the Application Layer: How Uber Evaluates Infrastructure Changes and Cloud Skus
Uber’s Ceilometer framework automates infrastructure performance benchmarking beyond applications. It standardizes testing across servers, workloads, and cloud SKUs, helping teams validate changes, identify regressions, and optimize resources. Future plans include AI integration, anomaly detection, and continuous validation.
-
Beyond Win Rates: How Spotify Quantifies Learning in Product Experiments
Spotify has introduced the Experiments with Learning (EwL) metric on top of its Confidence experimentation platform to measure how many tests deliver decision-ready insights, not just how many “win.” EwL captures both the quantity and quality of learning across product teams, helping them make faster, smarter product decisions at scale. The outcome must support one action: ship, abort, or iterate.