InfoQ Homepage Performance Content on InfoQ
-
The Mechanics of Metrics: Aggregation across Dimensions
Erin Schnabel discusses how application metrics align with other observability and monitoring methods, from profiling to tracing, and the limits of aggregation.
-
Panel: Kubernetes at Web Scale on the Cloud
The panelists discuss what they have learned scaling their own workload in the public cloud. Topics include capacity and workload management, security integration, and homegrown PaaS integration.
-
Authorization at Netflix Scale
Travis Nelson discusses Netflix’s approach to scaling and shares techniques for distributed caching and isolating failure domains.
-
Software Engineering – Then, Now, and Next
Mary Poppendieck discusses how software engineering has been changed by the scale and speed required of digital companies in the past, now, and in the future.
-
Panel: Observability and Understandability
Jason Yee, John Egan, and Ben Sigelman discuss their approaches and preferred methods to get impactful results in incident management, distributed tracing, and chaos engineering.
-
Resources & Transactions: a Fundamental Duality in Observability
Ben Sigelman explores resources and transactions, both theoretically and through some real-world examples, to develop an intuition for how to understand a system more completely.
-
Building and Scaling a Control Plane for 1000s of Kafka Clusters
Gwen Shapira and Vivek Sharma discuss some architectural highlights of building, evolving and scaling a control plane for thousands of Kafka clusters, and some challenges encountered.
-
Optimizing Your Web Performance: Separating the Signals from the Noise
Carl Anderson shares the journey Trainline has been on leading up to Google introducing Core Web Vitals as a ranking signal, discussing web performance.
-
Software Supply Chains for DevOps
Aysylu Greenberg discusses what needs to be collected to allow DevOps to inspect and verify the integrity of the supply chain, some of the existing solutions and open problems in this space.
-
User Simulation for Rapid Outage Mitigation
Carissa Blossom walks through the monitoring service that Uber developed to identify issues in production at the individual city level all across the globe.
-
Reduce ‘Unknown Unknowns’ across Your CI/CD Pipeline
The panelists discuss monitoring and observability methods that DevOps and SRE teams can employ to balance change and uncertainty without the need to constantly reconfigure monitoring systems.
-
Embracing Observability in Distributed Systems
Michael Hausenblas discusses good practices and current developments around CNCF open source projects and specifications including OpenTelemetry and FluentBit.