InfoQ Homepage Architecture & Design Content on InfoQ
-
The Rise of the Serverless Data Architectures
Gwen Shapira explores the implications of serverless workloads on the design of data stores, and the evolution of data architectures toward more flexible scalability.
-
From Open Source to SaaS: the Journey of ClickHouse
Sichen Zhao and Shane Andrade discuss architectural design decisions and some of the pitfalls one may run into along the way.
-
Reliable Architectures through Observability
Kent Quirk shows an overview of observability tools and techniques, and specific recommendations for how to fit observability into their system designs and day-to-day development process.
-
Banking on Thousands of Microservices
Suhail Patel covers lessons learned creating a banking platform on the cloud that serves over 7 million customers daily and relies on a lean engineering team, microservices, Cassandra, and Kubernetes.
-
How to Build a Reliable Kafka Data Processing Pipeline, Focusing on Contention, Uptime and Latency
Lily Mara shares how OneSignal improved the performance and maintainability of its highest-throughput HTTP endpoints (backed by a Kafka consumer in Rust) by making it an asynchronous system.
-
Deconstructing an Abstraction to Reconstruct an Outage
Chris Sinjakli explores the aftermath of a complex outage in a Postgres cluster, retracing the steps taken to reliably reproduce the failure in a local environment.
-
Hard Problems in Front-End Platforms
Katie Sylor-Miller discusses the world of Front-end Platform Engineering, exploring the unique challenges, strategies, and best practices involved in creating robust, scalable, and reliable systems.
-
Eventual Consistency – Don’t Be Afraid!
Susanne Braun shares her experiences from different case studies with industry clients, and open access design guidelines developed using action research.
-
Architecting a Production Development Environment for Reliability
At Meta, developers use servers (devservers) to perform their daily work. This talk discusses their software architecture and the mechanisms employed to ensure they remain reliable and available.
-
Using Traffic Modeling to Load-Balance Netflix Traffic at Global Scale
Niosha Behnam and Sergey Fedorov discuss how Netflix shifted from geo-based DNS load-balancing to a latency-based approach, relying on real-user measurements and building a model of Netflix traffic.
-
The Joy of Building Large Scale Systems
Suhail Patel discusses the art and practice of building systems from core principles with a focus on how this can be done in practice within teams and organisations.
-
Performance: Adventures in Thread-per-Core Async with Redpanda and Seastar
John Spray describes an experience of building high performance systems with C++20 in an asynchronous runtime, and explores the challenges & tradeoffs in adopting a thread-per-core architecture.