• Architecture & Design Follow 2524 Followers

    Resilient Systems in Banking

    by Greg Hawkins Follow 4 Followers on  Oct 06, 2018

    Resilience is about tolerating failure, not eliminating it. To build a resilient system, you must build a system that absorbs shocks, and continues or recovers. Following best practices for resilient architecture, including established cloud patterns, allowed Starling Bank to build a bank, from scratch, in a year, against a backdrop of highly public outages amongst incumbent banks.

  • DevOps Follow 1010 Followers

    Chaos Conf Q&A: The Benefits, Challenges and Practices of Chaos Engineering

    by Daniel Bryant Follow 798 Followers on  Sep 14, 2018

    This Q&A, from the upcoming Chaos Conf event that is running in San Francisco in September, examines the benefits and challenges of chaos engineering. The article also provides emerging good practice, and contains prerequisites, recommendations, and tips for getting started.

  • DevOps Follow 1010 Followers

    DevOps and Cloud InfoQ Trends Report - January 2018

    by Daniel Bryant Follow 798 Followers , Manuel Pais Follow 9 Followers , Steffen Opel Follow 4 Followers , Richard Seroter Follow 8 Followers , Chris Swan Follow 616 Followers on  Jan 31, 2018 3

    This article, following on from the Culture and Methods piece we published last week, provides a summary of how we currently see the operations space, which for us is mainly DevOps and cloud.

DevOps Follow 1010 Followers

Chaos Engineering

Posted by Niosha Behnam Follow 0 Followers , Luke Kosewski Follow 0 Followers , Justin Reynolds Follow 0 Followers , Casey Rosenthal Follow 0 Followers , Ali Basiri Follow 0 Followers , Ruud de Rooij Follow 0 Followers , Lorin Hochstein Follow 0 Followers on  Jan 08, 2017

Many large tech organizations are using experimentation to verify distributed systems' reliability. Netflix engineers have determined several principles underlying it and used it to run experiments.


Book Review and Interview: The Practice of Cloud System Administration

Posted by Richard Seroter Follow 8 Followers on  Dec 18, 2014

The new book, The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, looks at a wide range of considerations for cloud-scale systems.


Jonas Bonér on Reactive Systems Anti-Patterns

Posted by Sergio De Simone Follow 21 Followers on  Oct 20, 2014

Jonas Bonér, TypeSafe CTO and original author of the first Reactive Manifesto, offered his thoughts about both desirable features of reactive applications and what is not reactive programming. 1


Russ Miles on Antifragility and Microservices

Posted by Ralph Winzinger Follow 0 Followers on  May 13, 2014

Currently, Antifragility and Microservices are trending topics and this might be a hint that there are new architectural paradigms or design patterns on their way for building application systems.


Interview with Raffi Krikorian on Twitter's Infrastructure

Posted by Xuefeng Ding Follow 0 Followers on  Jan 19, 2014

Raffi Krikorian, Vice President of Platform Engineering at Twitter, gives an insight on how Twitter prepares for unexpected traffic peaks and how system architecture is designed to support failure. 1