InfoQ Homepage application performance management Content on InfoQ
-
Chaos Engineering: Building Immunity in Production Systems
Nikhil Barthwal discusses Chaos Engineering, its purpose, how to go about it, metrics to collect, the purpose of monitoring and logging, etc.
-
Canopy: Scalable Distributed Tracing & Analysis @ Facebook
Haozhe Gao and Joe O’Neill present Canopy, Facebook’s performance and efficiency tracing infrastructure. They talk about the lessons learned and present case studies of its use.
-
Observability to Better Serverless Apps
Erica Windisch dives into how serverless development with observability tooling can help bridge the gap between operations and business intelligence to learn better and iterate faster.
-
Chick-Fil-A: Milking the Most out of 1000's of K8s Clusters
Brian Chambers and Caleb Hurd share how Chick-fil-A manages connections and deployments using two to-be-announced open source projects, and lessons learned from running Kubernetes at the Edge.
-
Observable JS Apps
Emily Nakashima talks about an event-driven approach to client-side observability for the most complicated parts of Honeycomb's customer-facing React app: the query builder.
-
Java at Speed
Gil Tene talks about getting the most of Java applications and understanding some of the optimizations the latest crop of JVMs are able to apply when running on the latest servers.
-
PCF Platform Monitoring with Prometheus and Grafana
Jamie Christian and Alan Strader discuss Northern Trust's platform monitoring solution based on Grafana, Prometheus and Alertmanager.
-
The Present and Future of Serverless Observability
Yan Cui overviews the challenges observing a serverless architecture, the tradeoffs to consider, the current state of the tooling for serverless observability, taking a look at new and coming tools.
-
Testing Observability
Amy Phillips discusses the impact of observability on testing, from new techniques, greater Dev and Ops involvement, right through to whether testing is needed anymore.
-
PerfView: The Ultimate .NET Performance Tool
Sasha Goldshtein talks about PerfView, an open source tool for .NET performance diagnosis used in production environments to solve CPU, wall-clock time, and memory usage issues.
-
How to Build Observable Distributed Systems
Pierre Vincent covers key techniques to build distributed applications, including details on useful health checks, best practices for instrumentation with metrics, logging and tracing.
-
Continuous Optimization of Microservices Using ML
Ramki Ramakrishna shares Twitter’s recent experience in applying Bayesian optimization to the performance tuning problem, discussing a service used for continuously optimizing microservices.