BT
rss
Culture & Methods Follow 798 Followers

Atlassian Announces Solutions for Incident Management

by Ben Linders Follow 28 Followers on  Sep 20, 2018

Atlassian announced on September 4 that they have launched a new product called Jira Ops and that they will acquire OpsGenie. Organizations can use Jira Ops for resolving incidents and doing post-mortems to learn from them. OpsGenie adds prompt and reliable alerting to Jira Ops.

Culture & Methods Follow 798 Followers

Psychological Safety in Post-Mortems

by Ben Linders Follow 28 Followers on  Sep 06, 2018 1

Emotions often come to the fore when there is an incident; psychological safety in blameless post-mortems is essential for the learning process to happen. The post-mortem session must be fairly moderated, preferably by an outsider, giving everyone a turn to speak without criticism. Don’t start the analysis of the incident before there is a clear and common understanding of what actually happened.

DevOps Follow 971 Followers

How ING Bank Does SRE

by Manuel Pais Follow 9 Followers on  Dec 30, 2017

Janna Brummel and Robin van Zijll, from ING Netherlands, talked at the Velocity conference in London about how poor availability from their internet banking systems prompted the bank to implement an SRE culture. A centralized SRE team was set up in the Netherlands to provide tooling, consulting and education on reliability to product teams (known as BizDevOps squads internally).

DevOps Follow 971 Followers

Post-Mortems Trends and Behaviors

by Manuel Pais Follow 9 Followers on  Nov 29, 2017

Eric Siegler presented his findings at Velocity from analyzing data from 1000 post-mortems ran by 125 different organizations over a six month period. Main trends include the prevalence of blameless post-mortems; the fact that only 1 in 100 post-mortems refer to "human error"; and that analyzing the lifecycle of incidents can provide useful insights on weaknesses in the incident response process.

DevOps Follow 971 Followers

John Willis Talks DevOps Superpatterns at DOES17 London

by Helen Beal Follow 4 Followers on  Jun 26, 2017

John Willis, co-author of The DevOps Handbook, spoke about the emerging DevOps Superpattern at the 2017 DevOps Enterprise Summit June 5th and 6th in London.

Followers

Handling Incidents and Outages

by João Miranda Follow 2 Followers on  Jun 29, 2015 2

David Mytton, CEO at Server Density, shared with the devopsdays Amsterdam 2015 crowd how they handle incidents and outages. The process is grounded on a key set of principles: frequent public updates; exhaustive logging of the response activities; team effort and effective escalation. Server Density draws a lot of inspiration from the aviation industry, renowned for its safety procedures.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT