InfoQ Homepage Failure Content on InfoQ
-
Building and Trusting a Cloud Bank
Greg Hawkins discusses how Starling Bank, part of the new movement in FinTech challenger banks, is innovating while addressing the need for resilience in a world where failure is everywhere.
-
Freeing the Whale: How to Fail at Scale
Oliver Gould discusses Finagle, a library providing a uniform model for handling failure at the communications layer, enabling Twitter to fail, safely and often.
-
Framing Our Potential for Failure
Michelle Brush discusses modeling complex systems and architectural changes that could introduce new modes of failure, using examples from embedded systems to large stream processing pipelines.
-
Applying Failure Testing Research @Netflix
Kolton Andrus and Peter Alvaro present how a “big idea” -- lineage-driven fault injection -- evolved from a theoretical model into an automated failure testing service at Netflix.
-
Architecting for Failure in a Containerized World
Tom Faulhaber discusses the new container-based toolbox for building systems that are robust in the face of failures, how to recover from failure and how the tools can be used to best effect.
-
Stranger Things: The Forces that Disrupt Netflix
Haley Tucker discusses how other systems may affect Netflix' services, strategies to protect their systems and make sure they won't fail even if things go wrong.
-
0 to 100 days - Running DRTs at Dropbox
Thomissa Comellas shares her experiences developing and rolling out new Disaster Recovery Testing techniques at Dropbox. Tammy Butow shares how her team runs DRTs and has implemented the techniques.
-
Move Faster and Break Even More Things
Marcus Frodin discusses a few failures he has overseen at Spotify, deriving a framework of how to think about and evaluate what worked and what didn’t, and how to get more of the things that did.
-
Solving Business Problems with Data Science
The panelists discuss some of the unique problems that only data science can solve, the pitfalls and the success rate of data science projects.
-
Microservices: State of the Union
Adrian Cockcroft discusses success/failure stories of adopting microservices, overviews what’s next with microservices and presents some of the techniques that have led to successful deployments.
-
A Darwinian Approach to APIs in Government Health
Bruce Haefele shares from the successes and failures implementing an API strategy at Healthdirect Australia.
-
What We Talk about When We Talk about Distributed Systems
Alvaro Videla reviews distributed systems: async/sync, message passing, shared memory, failure detectors, leader election, consensus and different kinds of replication, and recommends related books.