InfoQ Homepage Monitoring Content on InfoQ
-
Conquering Microservices Complexity @Uber with Distributed Tracing
Yuri Shkuro talks about how Uber is using distributed tracing to make sense of a large number of microservices and the interaction among them.
-
Instrumentation, Observability & Monitoring of Machine Learning Models
Josh Wills discusses the monitoring and visibility needs of machine learning models in order to bridge gaps between ML practitioners and DevOps.
-
Expect the Unexpected: How to Deal with Errors in Large Web Applications
Mats Bryntse demonstrates how to implement error monitoring in a web app and also shows how to reproduce errors without having to ask the user for a step-by-step description.
-
Spinnaker and the Distributed Monorepo
Jon Schneider presents a continuous delivery platform with application monitoring, automated canary analysis, and organization-wide code search showing how to identify and repair applications.
-
Enabling .NET Apps with Monitoring and Management Using Steeltoe
Dave Tillman discusses using the Steeltoe Management frameworks to enable a .NET application with performance monitoring, management diagnostic endpoints, and distributed tracing on PCF.
-
Restoring Confidence in Microservices: Tracing That's More Than Traces
Ben Sigelman talks about rethinking distributed tracing in terms of the most vital organizational problems that microservices introduced.
-
Jupyter Notebooks: Interactive Visualization Approaches
Chakri Cherukuri talks about how to understand and visualize machine learning models using interactive widgets and introduces the widget libraries.
-
Yes, I Test in Production (And So Do You)
Charity Majors talks about testing in production and the tools and principles of canarying software and gaining confidence in a build, also instrumentation and observability .
-
Radical Realizations with Tracing & Metric Visualizations
David Crawford, Sean Keery share insights about combining tracing data & metrics with animated traffic dashboards to convey a more comprehensive understanding of the variables in play.
-
Monitoring AI with AI
Iskandar Sitdikov discusses a solution, tooling and architecture that allows an ML engineer to be involved in delivery phase and take ownership over deployment and monitoring of ML pipelines.
-
Expect the Unexpected: How to Handle Errors Gracefully
Bastian Hoffman discusses monitoring and logging errors, showing how to handle them, covering deployment strategies with circuit breakers, and reducing functionality to minimize impact.
-
Chaos Engineering: Building Immunity in Production Systems
Nikhil Barthwal discusses Chaos Engineering, its purpose, how to go about it, metrics to collect, the purpose of monitoring and logging, etc.