InfoQ Homepage Presentations Breaking Bad at Netflix: Building Failure as a Service
Breaking Bad at Netflix: Building Failure as a Service
Summary
Kolton Andrus presents how Netflix, in order to harden their systems, designed “Failure as a Service” to allow anyone to test and validate how their systems handle failure.
Bio
Kolton Andrus (@deelyle) is a Chaos Engineer on Netflix’s Edge Platform team. He designed and built FIT, a failure injection service. Prior to Netflix, he worked in Amazon Retail where he built Gremlin, Amazon’s failure service. In both companies he has served as a ‘Call Leader’, managing the resolution of large scale incidents.
About the conference
Software is Changing the World. QCon empowers software development by facilitating the spread of knowledge and innovation in the developer community. A practitioner-driven conference, QCon is designed for technical team leads, architects, engineering directors, and project managers who influence innovation in their teams.