Newer rss

Architecting for Failure at the

Posted by Michael Brunton-Spall  on  Apr 25, 2012

Michael Brunton-Spall talks about various types of system failure that can happen, sharing the lessons learned at the Guardian and measures taken to prevent and mitigate failure.

Resilient Response In Complex Systems

Posted by John Allspaw  on  Apr 19, 2012

John Allspaw discusses pitfalls to be avoided while troubleshooting failed systems, comparing web operations at scale with practices in aviation and nuclear power industries.

On Distributed Failures (and handling them with Doozer)

Posted by Blake Mizerany  on  Dec 27, 2011 1

Blake Mizerany presents various ways that can lead to system failure in distributed systems and how to recover using Doozer, a highly available, consistent data store.

Things Break, Riak Bends

Posted by Justin Sheehy  on  Aug 09, 2011

Justin Sheehy talks about failure and the need to prepare for it, giving some real life examples along with techniques implemented in Riak to make it resilient to faults.

Everything I've Ever Learned, I Learned from Failure

Posted by Robert Myers  on  Apr 07, 2011 1

Robert Myers talks about the role played by failure in Agile development, sharing a number of Lean and Agile practices helping to embrace failure and showing how to interpret the feedback received.

Failures and Successes with Reuse

Posted by Herbjörn Wilhelmsen  on  Mar 23, 2011 4

Herbjörn Wilhelmsen discusses the reasons why an SOA project failed while trying to reuse existing resources, and how it succeeded later starting from the same business case with reuse in mind.

Embracing Concurrency At Scale

Posted by Justin Sheehy  on  Jun 23, 2010

Justin Sheehy explains the principles behind concurrent distributed systems: no global state, no ACID but rather BASE, no RPC but protocols over APIs, prepare for failure, degradation, measurement.

Failure Comes in Flavors - Stability Anti-patterns

Posted by Michael Nygard  on  Sep 15, 2009 4

Michael Nygard encourages us to have a failure oriented mindset. He presents many anti-patterns leading to systems instability and failure, accompanied by design patterns that should be used instead.

10 Ways to Screw Up with Scrum and XP

Posted by Henrik Kniberg  on  Aug 20, 2008 4

Henrik Kniberg talks about 10 possible reasons to fail while doing Scrum and XP. Maybe the team does not have a definition of what Done means to them, or they don't know what their velocity is.

"We Suck Less!" Is Not Enough

Posted by David Douglas & Robin Dymond  on  Aug 15, 2008 4

David Douglas and Robin Dymond discuss about companies adopting Agile, but don't go all the way, resulting in failure and rejection of it, and predictably having a negative impact on Agile's future.

General Feedback
Editorial and all content copyright © 2006-2015 C4Media Inc. hosted at Contegix, the best ISP we've ever worked with.
Privacy policy