InfoQ Homepage Resilience Content on InfoQ
-
Chaos Architecture
Adrian Cockcroft takes a look at best practices and challenges in getting to a chaos architecture mindset.
-
Chaos Engineering on a Budget
Heather Nakama tells the story of implementing chaos testing on a small product, and how several small and targeted early investments in chaos engineering saved time and effort.
-
Chaos: The Last Stand against Our Robot Overlords
Nathan Äschbacher talks about Chaos Engineering and how to shift towards working with chaos instead of against it, in order to build safe, reliable, and increasingly deterministic complex systems.
-
Expedia’s Journey toward Site Resiliency
Sahar Samiei and Willie Wheeler share Expedia’s resiliency journey, starting with resiliency as an afterthought and progressing toward resiliency as a first-class concern.
-
Designing Services for Resilience: Netflix Lessons
Nora Jones talks about designing microservices for enabling resiliency testing and the moving parts we need to consider when designing them from the get go, and along their lifetime.
-
“GameDay” – Achieving Resilience through Chaos Engineering
Pete Cohen and Matt Fellows discuss GameDay and chaos engineering, what they are, and how they were done successfully by some organizations.
-
Confusion in the Land of the Serverless
Sam Newman introduces serverless computing, discussing how security, resilience, patterns (circuit breaker), vendor lock-in, and microservices are addressed with this technology.
-
State of Chaos Engineering
Bruce Wong discusses the current state of Chaos Engineering, emerging patterns of success, and the future opportunity at hand.
-
Real-Time Metrics Computations to Power Video Quality of Delivery Resilience
Sudhi Nada shows how Conviva deals with video delivery issues, illustrating the business impact of video delivery failures and how to build resiliency into such systems based on real-time metrics.
-
Choose Your Own Adventure: Chaos Engineering
Nora Jones talks about different experiences on "Chaos Adventures" including both successes and failures introducing Chaos in an organization.
-
Drinking from the Elixir Fountain of Resilience
Jearvon Dharrie talks about the factors that contribute to Elixir's perfect match for fault tolerance and resiliency, besides the Open Telecom Platform (OTP).
-
Nonconformist Resilience: DB-backed Job Queues
John Mileham presents how they use, deploy Delayed::Job (a database-backed job queue) at Betterment for its transactional enqueue semantics, safe retry with exponential backoff, and its storage model.