BT
rss
DevOps Follow 483 Followers

Chaos Engineering at Twilio

by Hrishikesh Barua Follow 8 Followers on  Dec 25, 2017

The Twilio team describes their foray into Chaos Engineering where they use Gremlin to inject failures into their homegrown queuing system shards to test for automated recovery.

Cloud Follow 173 Followers

Werner Vogels on “21st Century [Cloud] Architectures”: Availability, Reliability and Resilience

by Daniel Bryant Follow 445 Followers on  Dec 03, 2017

At the AWS re:invent 2017 conference, Werner Vogels, CTO of Amazon, presented a keynote that discussed core concepts required for building “21st Century Architectures” on the cloud. Highlights of the talk included discussion of the emerging practices of evolutionary and “cloud native” architectures, the role of security becoming everyone’s responsibility, and the benefits of chaos engineering.

DevOps Follow 483 Followers

Expedia's Journey toward Site Resiliency: Embracing Chaos Testing in Dev and Production at QCon SF

by Daniel Bryant Follow 445 Followers on  Nov 19, 2017

At QCon SF, Sahar Samiei and Willie Wheeler presented “Expedia’s Journey Toward Site Resiliency”, and discussed the building of a community of practice around resilience testing within Expedia. The results have generally been positive: Netflix’s Chaos Monkey has been running daily in production since May 15th; and resilience tests have been added to four Tier 1 service pipelines.

Architecture & Design Follow 1346 Followers

Adrian Cockcroft Discusses Chaos Architecture: "Four Layers, Two Teams, and an Attitude"

by Daniel Bryant Follow 445 Followers on  Nov 17, 2017 1

At QCon San Francisco, Adrian Cockcroft presented “Chaos Architecture”, and discussed the evolution of cloud native architecture, and how chaos engineering can be applied to produce better and safer systems. Effective chaos architecture and engineering was presented as consisting of “four layers, two teams, and an attitude”.

DevOps Follow 483 Followers

Designing Services for Resilience: Nora Jones Discusses Netflix Chaos Engineering at QCon SF

by Daniel Bryant Follow 445 Followers on  Nov 16, 2017

At QCon SF Nora Jones presented “Designing Services for Resilience Experiments: Lessons from Netflix”. Key takeaways from the talk included: the customer experience is a priority; designing for resiliency testability is a shared responsibility; configuration changes can cause outages; and engineers should have have explicit monitoring in place to detect antipatterns in configuration changes.

DevOps Follow 483 Followers

Choose Your Own Adventure: Chaos Engineering at QCon New York 2017

by Pierre-Luc Maheu Follow 2 Followers on  Aug 22, 2017

Nora Jones, senior chaos engineer at Netflix, talked about chaos engineering at QCon New York 2017. She presents different stages of chaos engineering adoption and gives stories from her previous experiences at Jet and Netflix.

Cloud Follow 173 Followers

Netflix Engineer Lorin Hochstein on Chaos Monkey 2.0

by Rags Srinivas Follow 3 Followers on  Oct 25, 2016

Netflix made waves when it initially announced Chaos Monkey, a tool that would terminate normally healthy VM instances in production. The goal was to embrace failure and thereby increase resiliency. Rags Srinivas caught up with Lorin Hochstein at Netflix regarding the recent upgrade to Chaos Monkey.

DevOps Follow 483 Followers

Chaos Monkey 2.0 Runs via Spinnaker

by Abel Avram Follow 5 Followers on  Oct 24, 2016

Netflix has recently made available the source code of the Chaos Monkey 2.0. The latest iteration of the resilience tool is fully integrated with Spinnaker and event tracking systems, but the SSH support has been removed.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT