InfoQ Homepage Resilience Content on InfoQ
-
The History of Fire Escapes
Tanya Reilly looks at what can be learned from real world fire codes about expecting failure and designing for it.
-
Serving Millions of Customers Serverless at CapitalOne
Srini Uppalapati, Kiran Satelli talk about how CapitalOne migrated customer accounts and transactions to a completely serverless architecture, and built a resilient Transactions and Accounts platform.
-
Have You Tried Turning It off and on Again?
This talk features examples from the breadth of the SRE discipline to answer questions such as “what characteristics of an operations practice actively influence a system towards greater resiliency?”
-
Unbreakable: Learning to Bend But Not Break at Netflix
Haley Tucker shares examples of chaos experiments which identified problems and built confidence in Netflix’s resilience mechanisms, with challenges, lessons, and benefits scaling chaos engineering.
-
Using Chaos to Build Resilient Systems
Tammy Butow explains how to build resilient systems by focusing on the detection, mitigation, resolution and prevention of incidents.
-
Properties of Chaos
Nathan Aschbacher talks about how and why chaos engineering is being applied to autonomous vehicle safety, how property-based testing principles can influence chaos engineering goals, and more.
-
Heretical Resilience: To Repair is Human
Ryn Daniels describes the “Apache SNAFU”, shares their experiences as the instigator of that snafu and walks through the lessons that can be learned from such an event.
-
Chaos Engineering: Why the World Needs More Resilient Systems
Tammy Butow shares her experiences using chaos engineering to build resilient systems, when they couldn’t build their systems from scratch.
-
Pragmatic Resiliency: Super 6 & Sky Bet Evolution
Michael Maibaum talks about the reality of adapting a complex set of interacting, highly coupled applications to make them more resilient and better able to cope with failure.
-
Incident Management at Netflix Velocity
Dave Hahn talks about how Netflix engineering teams think about failure, why they believe chaos is their friend, failure is guaranteed, and why Netflix is better off having both.
-
Best Practices Building Resilient Systems
Pablo Jensen focuses on best practices and lessons learned in building resilient systems.
-
The Art of Chaos Engineering Panel
The panelists answer audience questions on the emerging field of chaos engineering including what chaos engineering is, how you get started with it, and pitfalls of adoption.