BT

Failure Injection Testing: Controlling Failure in Production

by Michael Stiefel on  Dec 12, 2014 2

Netflix's Failure Injection Testing bridges the gap between isolated testing and unmitigated chaos testing by controlling the impact of the test. FIT establishes a context which other components of Netflix's production testing and infrastructure systems interpret and adjust the behavior of the system accordingly.

Mindfulness and Situational Awareness in Organizations

by Ben Linders on  Nov 12, 2014 1

To thoroughly remove waste in a process you need flow to deliver just in time, and mindfulness and situational awareness in organizations to handle problems with processes and built in human intelligence. Organizations apply concepts from flow to develop what is needed and when it is needed and use pull to prevent inventories. What they also need is “Jidoka”: mindfulness and situational awareness.

How Netflix Handled the Reboot of 218 Cassandra Nodes

by Abel Avram on  Oct 28, 2014

Amazon performed a major maintenance update at the end of September in order to patch a security vulnerability in a Xen hypervisor affecting about 10% of their global fleet of cloud servers. This update involved the rebooting of those servers, with consequences for AWS users and the services they provide, including one of their largest clients, Netflix.

Avoidance of Organizational Dysfunction Leads to Scrum Masters' Failure

by Savita Pahuja on  Oct 17, 2014 2

Bob Marshall explains the reason of failing of scrum master in most of the organizations as the lack of awareness on the part of adopting scrum and scrum master’s responsibility to tackle organizational dysfunction.

Leslie Lamport on Distributed Systems and Precise Thinking

by Sergio De Simone on  Oct 16, 2014

Leslie Lamport is the author of some of the most cited computer science papers and won a Turing Award in 2013 for his seminal work in distributed and concurrent systems. This is a summary of an interview that Lamport gave to Software Engineering Radio touching themes such as his early work in distributed systems and the importance of precise thinking in programming.

Fail Fast Means Learn Fast

by Rui Miguel Ferreira on  Jul 04, 2014

Failing fast and often is one of the encouraged practices for agile teams. Sander Hoogendoorn, author of the This is Agile book discusses on his blog the importance of having a strategy that helps you on the decision of aborting a project by assuming its failure on an early stage.

Working with Investors as a Lean Startup

by Ben Linders on  Mar 13, 2014

Entrepreneurs using lean startup can work with investors to raise capital for their business. Business plans from lean startups often differ from traditional startups and lean startup encourages learning from failure and to pivot, which might scare off investors. Can entrepreneurs and investors together use the lean startup approach to do fundraising?

Attitudes for Sustainable Lean Startup Teams

by Ben Linders on  Mar 11, 2014

Ramli John gave an ignite talk about the minimum viable attitudes for lean startup teams at the 2013 lean startup conference. According to Ramli there are three attitudes that help teams to run lean sustainable over time: humbleness, hunger and happiness.

How Can You Learn Early and Fast?

by Ben Linders on  Dec 26, 2013

Agile suggest that teams should fail-fast to enable quick learning from mistakes. Learning from failure is one approach, you can also learn early and fast from successes, by doing experimentation, or by using a plan for knowledge acquisition.

Testing Resiliency at PagerDuty Without a Simian Army

by Manuel Pais on  Nov 12, 2013

Doug Barth, from PagerDuty, talked at DevOps Days London about their approach to start resiliency testing their systems without dedicating a lot of automation effort upfront. The goal was to quickly start learning about failure points and openly discuss how to fix them with only one hour per week of effort.

Learning from Failures with The Lean Startup

by Ben Linders on  May 02, 2013 2

The lean startup is about fast delivery of desired products to customers, and increasing your understanding about the needs of customers. With the lean startup, people can learn faster from failures and become better innovators. There are teachers that use a lean startup based approach in education, which helps their students to learn faster.

Avoiding Downtime When Cloud Services Fail

by Abel Avram on  Jul 03, 2012

Another AWS outage hit several large websites and their services last week. What can be done to avoid downtime? Architect for failover not just for scale.

Adopting Agile in an Environment of Fear

by Amr Elssamadisy on  Jul 25, 2011 3

Agile adoption and transformation is sometimes effective, and sometimes not. Is there a common thread to the failures? Does fear have anything to do with it? And what can we expect if we start an agile adoption initiative in an environment that is full of fear?

All Right It Failed, What Next?

by Vikas Hazrati on  Jun 29, 2011 2

Usually failures result in anger, frustration and playing the blame game. However, failures are wasted if there is no learning from them. How can Agile teams make failures beautiful?

Commercial Interests Censoring Failures

by Amr Elssamadisy on  Jun 20, 2011 2

Philippe Kruchten described the Agile movement as "The agile movement is in some ways a bit like a teenager: very self-conscious, checking constantly its appearance in a mirror, accepting few criticisms..." and shared a list of twenty elephants in the room - uncomfortable issues that are ignored on purpose. The first of these unmentionables is that commercial interests are censoring failures.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT