InfoQ Homepage Observability Content on InfoQ
-
The Mechanics of Metrics: Aggregation across Dimensions
Erin Schnabel discusses how application metrics align with other observability and monitoring methods, from profiling to tracing, and the limits of aggregation.
-
Panel: Observability and Understandability
Jason Yee, John Egan, and Ben Sigelman discuss their approaches and preferred methods to get impactful results in incident management, distributed tracing, and chaos engineering.
-
Resources & Transactions: a Fundamental Duality in Observability
Ben Sigelman explores resources and transactions, both theoretically and through some real-world examples, to develop an intuition for how to understand a system more completely.
-
Reduce ‘Unknown Unknowns’ across Your CI/CD Pipeline
The panelists discuss monitoring and observability methods that DevOps and SRE teams can employ to balance change and uncertainty without the need to constantly reconfigure monitoring systems.
-
Embracing Observability in Distributed Systems
Michael Hausenblas discusses good practices and current developments around CNCF open source projects and specifications including OpenTelemetry and FluentBit.
-
True Observability Needs High-Cardinality
Pierre Vincent discusses how high-cardinality observability helps the exploration and debugging power required to understand the reality of a production system.
-
Solving Mysteries Faster with Observability
Elizabeth Carretto discusses observability at Netflix and where and how their internal tool, Edgar, comes into play.
-
InfoQ Live Roundtable: Observability Patterns for Distributed Systems
The panelists explore how a sound observability strategy can help mitigate operational costs and avoid common pitfalls in monitoring distributed systems.
-
Observability in the Development Process: Not Just for Ops Anymore
Christine Yen explores what observability looks like in practice, so that production stops being just where the development code runs into issues.
-
Architectures That Scale Deep - Regaining Control in Deep Systems
Ben Sigelman talks about "Deep Systems" and their common properties: they are layered, distributed, concurrent, multi-tenant, change continuously, and are hard to manage with conventional tools.
-
Yes, I Test in Production (And So Do You)
Charity Majors talks about testing in production and the tools and principles of canarying software and gaining confidence in a build, also instrumentation and observability .
-
Observability to Better Serverless Apps
Erica Windisch dives into how serverless development with observability tooling can help bridge the gap between operations and business intelligence to learn better and iterate faster.