We take a look at Etsy's blameless postmortems, both in terms of philosophy, process and practical measures/guidance to avoid blame and better prepare for the next outage. Because failures are inevitable in complex socio-technical systems, it’s the failure handling and resolution that can be improved by learning from postmortems.
Óscar San José, technical lead at Tuenti (largest Spanish social network) explains how and why their in-house Flow deployment system allowed developer teams to be more independent and deliver faster.
If you are building or designing your next monitoring system, take a look at this short list of habits exhibited by the most successful monitoring systems in the world today. 1
This article features highlights from interviews on the state of practice and challenges in release engineering space. Interview questions cover topics like metrics, continuous delivery's benefits. 2
In this series of articles, you get practical advice from those who have experience helping companies successfully move to cloud environments.
Sriram Narayan’s book – Agile IT Organization Design, provides a basis for reviewing and reshaping the IT organization to equip it better for the digital age.
In this article, authors examine the enterprise cloud market and technologies and provide guidance for choosing the right cloud solution. They also discuss the cloud computing best practices.
The book Devops in Practice: Reliable and automated software delivery by Danilo Sato provides a hands-on approach for implementing continuous delivery and DevOps practices.
This article contains an extensive interview on the microservices adoption process, the benefits and difficulties of implementing microservices, with representatives from Gilt, Hailo and nearForm. 5
Enterprises have continued to accelerate their adoption of cloud infrastructure. As this shift continues, it's important to understand what this means to applications that run in cloud environments.