InfoQ Homepage Architecture & Design Content on InfoQ
-
Pitfalls and Patterns in Microservice Dependency Management
Silvia Esparrachiari shares stories on how a small change can impact a system, discussing the importance of having a broad view of a system to better understand how a change can impact a system.
-
Managing the Risk of Cascading Failure
Laura Nolan discusses some of the mechanisms that cause cascading failures, what can be done to reduce the risk, and what to do if there is a cascading failure situation.
-
User Adaptive Security
Christina Camilleri and Jesse Kriss discuss how Netflix has readjusted their investments around user-focused security, and explore strategies towards a tiered access approach within endpoint security.
-
Essential Complexity in Systems Architecture
Laura Nolan looks at some real distributed system architectures and examines the tradeoffs made, showing how simple systems can create complex and difficult to understand behaviors.
-
Pragmatic Performance - Tales from the Trenches
Ramesh Subramonian shares techniques used to improve the performance of an existing system.
-
The World Is on Fire and so Is Your Website
Ann Lewis discusses how MoveOn architects and scales an ecosystem of custom tools that power political organizing work like rapid response mobilizations, vote programs, and data-driven campaigns.
-
How to Tame Your Service APIs: Evolving Airbnb’s Architecture
Jessica Tai discusses the challenges scaling to hundreds of services, how to simplify APIs, the trade-offs in API design, and how to test and operate flexible aggregator APIs and service blocks.
-
Rebuilding Twitter’s Public API
Steve Cosenza discusses why Twitter's multi-tenant API platform was built with Scala, GraphQL, and how Twitter uses SLOs for monitoring and alerting in production.
-
Serverless Search for My Blog with Java, Quarkus & AWS Lambda
Gunnar Morling discusses Quarkus, cold starts and serverless for Java.
-
Using DevEx to Accelerate GraphQL Federation Adoption @Netflix
Paul Bakker and Kavitha Srinivasan discuss how they made certain Build vs Buy (open source) trade-offs and the socio-technical aspects of working with many teams on a single shared schema.
-
User Simulation for Rapid Outage Mitigation
Carissa Blossom walks through the monitoring service that Uber developed to identify issues in production, and how they leveraged composable integration tests to cut the time to mitigation in half.
-
Change Data Capture for Distributed Databases @Netflix
Raghuram Onti Srinivasan covers the challenges associated with capturing CDC events from Cassandra, discussing the Flink ecosystem and the use of RocksDB.