InfoQ Homepage Presentations
-
User Simulation for Rapid Outage Mitigation
Carissa Blossom walks through the monitoring service that Uber developed to identify issues in production at the individual city level all across the globe.
-
Reduce ‘Unknown Unknowns’ across Your CI/CD Pipeline
The panelists discuss monitoring and observability methods that DevOps and SRE teams can employ to balance change and uncertainty without the need to constantly reconfigure monitoring systems.
-
Measuring Value Realization through Testing in Production
The panelists discuss what are the best patterns for testing in production and how testing in production can provide feedback that can be built back into the continuous delivery lifecycle of DevOps.
-
Building Reliability One Step at a Time
Ana Margarita Medina shares how she has been using Chaos Engineering and how it can be used to decouple our system’s weak points, learn from incidents and improve monitoring and observability.
-
Embracing Observability in Distributed Systems
Michael Hausenblas discusses good practices and current developments around CNCF open source projects and specifications including OpenTelemetry and FluentBit.
-
Production Infrastructure Cloning++: Reliability and Repeatability
JD Palomino discusses how they have developed a cloud and product-agnostic infrastructure pipeline to handle extra steps and custom configuration, with no special exceptions.
-
Helm: Past, Present, Future
Bridget Kromhout, Matt Butcher, Matt Farina discuss Helm, what they want to take it to, Helm 3 and 4.
-
Security and the Language of Intent
Tracy Holmes and Petros Kolyvas discuss why the language of security for infrastructure is often lost in translation and how policy as code can help.
-
Server-Side WASM: Today and Tomorrow
Connor Hicks explores WASM today, and the capabilities that it will have tomorrow, using the Suborbital Development Platform to illustrate how WASM modules can be used to compose server APIs.
-
Architecting for Focus, Flow, and Joy: beyond the Unicorn Project
The panelists discuss some of the most fun and least fun moments when coding, how functional programming practices have helped, and how productivity can be unleashed at a team-of-teams scale.
-
Less Mess, Less Stress: the Reliability Benefits of Custom Tools
Daniel Hochman discusses how an overreliance on vendor tooling leads to worse reliability outcomes, how Lyft lowered MTTR for its most common alerts using custom tooling, and how Clutch can help.
-
InfoQ Roundtable: Embracing Production: Make Yourself at Home
The panelists discuss operating distributed systems in production, how they embrace production, and ways to make it easier for others to onboard and keep the system up and running.