We take a look at Etsy's blameless postmortems, both in terms of philosophy, process and practical measures/guidance to avoid blame and better prepare for the next outage. Because failures are inevitable in complex socio-technical systems, it’s the failure handling and resolution that can be improved by learning from postmortems.
Storm Applied is a new book from Manning that aims to provide a practical guide on using Storm, both in a development and in a production setting. InfoQ has spoken with two of the book’s authors.
If you are building or designing your next monitoring system, take a look at this short list of habits exhibited by the most successful monitoring systems in the world today. 1
This article features highlights from interviews on the state of practice and challenges in release engineering space. Interview questions cover topics like metrics, continuous delivery's benefits. 1
In this series of articles, you get practical advice from those who have experience helping companies successfully move to cloud environments.
Sriram Narayan’s book – Agile IT Organization Design, provides a basis for reviewing and reshaping the IT organization to equip it better for the digital age.
JGroups has many features useful to a Raft consensus based implementation. In this article, Ugo Landini takes us through a project to implement a Raft consensus based algorithm on top of JGroups.
The book Devops in Practice: Reliable and automated software delivery by Danilo Sato provides a hands-on approach for implementing continuous delivery and DevOps practices.
This article contains an extensive interview on the microservices adoption process, the benefits and difficulties of implementing microservices, with representatives from Gilt, Hailo and nearForm. 5
An interview with Google's William Vambenepe, who's lead product manager for Big Data services, to ask him about the shift from products to services when working with Big Data.