InfoQ Homepage Site Reliability Engineering Content on InfoQ
Podcasts
RSS Feed-
Developer-First Observability with Micha “Mies” Hernandez van Leuffen
In this episode, Thomas Betts talks with Micha “Mies” Hernandez van Leuffen about observability and incidents, and the roles of developers, SREs and other team members. One challenge is knowing what metrics to track in the first place. A developer-first approach to observability means focusing on metrics that are specific to your application.
-
Tammy Bryant Butow on SRE Apprentices
In this episode, Thomas Betts speaks with Tammy Bryant Butow, principal SRE at Gremlin, about training new site reliability engineers. The discussion covers a formal SRE Apprenticeship program Butow led at DropBox, and gets into ideas about the best way to teach people new technical skills.
-
Johnny Boursiquot on Serverless Go and Site Reliability Engineering at Heroku
In this podcast, Johnny Boursiquot, Site Reliability Engineer at Heroku, sat down with InfoQ podcast co-host Daniel Bryant and discussed topics that included: why Go is a useful language for building Function-as-a-Service (FaaS) style applications; how Heroku implements the role of Site Reliability Engineer (SRE); and why the ability to teach is such a valuable skill.
-
Tanya Reilly on Site Reliability Engineering and the Evolution of the New York City Fire Code
Tanya Reilly discusses her research into how the fire code evolved in New York and draws on some of the parallels she sees in software. Along the way, she discusses what it means to be an SRE, what effective aspects of the role might look like, and her opinions on what we as an industry should be doing to prevent disasters.
-
Hiring and Growing Great Site Reliability Engineers
In this podcast Shane Hastie, Lead Editor for Culture & Methods spoke to Narayanan Raghavan, Senior Director for Site Reliability Engineering for Managed Services at Red Hat, about hiring and growing