InfoQ Homepage Distributed Systems Content on InfoQ

News

RSS Feed

Newer Older

Architecture & Design

GitHub Reworks Layered Defenses after Legacy Protections Block Legitimate Traffic

GitHub engineers recently traced user reports of unexpected “Too Many Requests” errors to abuse-mitigation rules that had accidentally remained active long after the incidents that prompted them.

Matt Foster
on Feb 04, 2026
Development

Cloudflare Open Sources tokio‑quiche, Promising Easier QUIC and HTTP/3 in Rust

Cloudflare has open-sourced tokio-quiche, an asynchronous QUIC and HTTP/3 Rust library that wraps its battle-tested quiche implementation with the Tokio runtime to simplify the development of high-performance QUIC applications. The library was used internally to back the edge services, the Oxy HTTP proxies or MASQUE-based tunnels replacing the Wireguard-based tunnels in the WARP client.

Olimpiu Pop
on Dec 27, 2025
Architecture & Design

Benchmarking beyond the Application Layer: How Uber Evaluates Infrastructure Changes and Cloud Skus

Uber’s Ceilometer framework automates infrastructure performance benchmarking beyond applications. It standardizes testing across servers, workloads, and cloud SKUs, helping teams validate changes, identify regressions, and optimize resources. Future plans include AI integration, anomaly detection, and continuous validation.

Leela Kumili
on Dec 26, 2025
Architecture & Design

From On-Demand to Live : Netflix Streaming to 100 Million Devices in under 1 Minute

Netflix’s global live streaming platform powers millions of viewers with cloud-based ingest, custom live origin, Open Connect delivery, and real-time recommendations. This article explores the architecture, low-latency pipelines, adaptive bitrate streaming, and operational monitoring that ensure reliable, scalable, and synchronized live event experiences worldwide.

Leela Kumili
on Dec 05, 2025
Architecture & Design

Stripe's Zero-Downtime Data Movement Platform Migrates Petabytes with Millisecond Traffic Switches

At QCon SF, a Stripe engineer presented the company's Zero-Downtime Data Movement Platform, a system enabling petabyte-scale database migrations with traffic switches that typically complete in milliseconds. The platform supports Stripe's infrastructure, handling 5 million database queries per second while maintaining 99.9995% reliability for $1.4 trillion in annual transactions.

Eran Stiller
on Nov 24, 2025
Architecture & Design

Netflix Tackles Data Deletion at Scale with Centralized Platform Architecture

Netflix engineers presented their architecture for a centralized data-deletion platform at QCon San Francisco, addressing a critical yet rarely discussed system design challenge. The platform manages deletion across heterogeneous data stores while balancing durability, availability, and correctness, processing 76.8 billion row deletions across 1,300 datasets with zero data loss incidents.

Eran Stiller
on Nov 21, 2025
Architecture & Design

QCon SF: Database-Backed Workflow Orchestration Challenges Traditional Architecture

During QCon SF, Jeremy Edberg and Qian Li from DBOS presented a non-conventional architectural approach to workflow orchestration: treating PostgreSQL not just as a data store, but as the orchestration layer itself. Their talk addressed a persistent problem in distributed systems: workflows frequently fail, recovery mechanisms are complex, and visibility into workflow state remains challenging.

Eran Stiller
on Nov 18, 2025
AI, ML & Data Engineering

QCon London 2026 Announces Tracks: AI Engineering, Building Teams, Tech of Finance, and More

The QCon London 2026 tracks are live: 15 practitioner-curated deep dives on AI adoption, resilient architectures, distributed systems, performance, modern languages, data, security, and Staff+ leadership, rooted in real production lessons.

Artenisa Chatziou
on Nov 03, 2025
Architecture & Design

Airbnb’s Mussel V2: Next-Gen Key Value Storage to Unify Streaming and Bulk Ingestion

Airbnb’s engineering team re-architected its internal key-value storage system, Mussel, to unify streaming and bulk ingestion while simplifying operations, achieving over 100,000 writes per second and sub-25ms read latencies on 100-terabyte tables, while leveraging Kubernetes, Kafka, and a NewSQL backend to improve scalability, reliability, and operational efficiency across its internal services.

Leela Kumili
on Oct 24, 2025
Architecture & Design

How LinkedIn Built Enterprise Multi-Agent AI on Existing Messaging Infrastructure

LinkedIn extended its generative AI application platform to support multi-agent systems by repurposing its existing messaging infrastructure as an orchestration layer. This allowed the company to scale AI agents without building new coordination technology from scratch and achieve global availability while supporting complex multi-step workflows through agent coordination.

Eran Stiller
on Sep 15, 2025
Architecture & Design

LinkedIn Re-Architects Edge-Building System to Support Diverse Inference Workflows

LinkedIn has detailed its re-architected edge-building system, an evolution designed to support diverse inference workflows for delivering fresher and more personalized recommendations to members worldwide. The new architecture addresses growing demands for real-time scalability, cost efficiency, and flexibility across its global platform.

Leela Kumili
on Sep 02, 2025
Cloud

Amazon SQS Fair Queues: a New Approach to Multi-Tenant Resiliency

AWS's new Fair Queues for Amazon SQS revolutionize message handling in multi-tenant systems by mitigating the "noisy neighbor" issue. This feature ensures low message dwell times for quieter tenants without requiring code changes, enhancing both performance and fairness. Developers can effortlessly implement this capability and maintain consistent service quality across applications.

Steef-Jan Wiggers
on Jul 31, 2025
Cloud

Microsoft Azure Enhances Observability with OpenTelemetry Support for Logic Apps and Functions

Microsoft has expanded OpenTelemetry support in Azure Logic Apps and Functions, enhancing observability and interoperability across platforms. This open-source framework enables seamless data generation and correlation, enhancing diagnostics beyond standard telemetry. With streamlined configuration and integration, Azure's offerings aim for standardized observability across cloud services.

Steef-Jan Wiggers
on Jun 23, 2025
Cloud

Temporal on AWS Aims to Ease Building Resilient Distributed Systems

Temporal Technologies, the company that created Temporal, an open-source microservices orchestration platform focused on durable execution, has made Temporal Cloud available on the AWS marketplace. By offering their services via AWS, the company aims to simplify the development of resilient distributed systems for large-scale applications.

Steef-Jan Wiggers
on May 09, 2025
Architecture & Design

Mezzalira at QCon London: Micro-Frontends from Design to Organisational Benefits and Deployments

During his QCon London presentation, Luca Mezzalira, principal architect at AWS, shared his experience in building the ideal micro frontend platform. He disclosed the recipe for determining if micro frontends are right for your company, as well as the core principles of creating the perfect architecture for your use case, and also provided deployment strategies for distributed architectures.

Olimpiu Pop
on Apr 30, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News