InfoQ Homepage Monitoring Content on InfoQ
-
AWS CloudWatch Adds Observability Tool for Visualizing Distributed Applications
AWS released ServiceLens, a fully managed observability solution built within CloudWatch. ServiceLens is designed to visualize and analyze the health, performance, and availability of distributed applications. Currently it is available in all commercial regions but requires the usage of AWS X-Ray.
-
Amazon Announces AWS Firelens – a New Way to Manage Container Logs
Recently, Amazon announced a new log aggregation service called AWS Firelens. The service unifies log filtering and routing across all AWS container services including Amazon ECS, Amazon EKS, and AWS Fargate.
-
Heroku's Journey to Automated Continuous Deployment
Heroku's engineering team wrote about their journey from manual deployments to automated continuous deployments for Heroku Runtime, their managed environment for applications. They achieved this using Heroku primitives and a custom deployer tool.
-
Full Stack Monitoring of JVM Applications, Using Micrometer
Clint Checketts, core committer of Micrometer Project, recently spoke at SpringOne Platform 2019 conference about Micrometer monitoring and alerting framework.
-
Amazon Releases the Anomaly Detection Feature for CloudWatch to General Availability
Recently, Amazon announced the general availability of the Anomaly Detection feature in Amazon CloudWatch, a monitoring and management service providing customers data and insights from AWS, hybrid, and on-premises applications and infrastructure resources.
-
Amazon Releases Container Monitoring for Amazon ECS, EKS, and Kubernetes via CloudWatch
Recently, Amazon announced that customers can now monitor, isolate, and diagnose their containerized applications and microservices environments using Amazon CloudWatch Container Insights. Cloud Insights is a part of Amazon CloudWatch, a fully-managed monitoring and observability service in AWS targeted for DevOps engineers, developers, site reliability engineers (SREs), and IT managers.
-
Instana Pipeline Feedback for Release Performance
Application performance management service provider Instana launched Pipeline Feedback for release performance tracking and analysis. Pipeline Feedback provides automatic tracking of application releases, feedback on release performance, and integration with Jenkins.
-
Microsoft Releases a Preview of the Integration of Prometheus with Azure Monitor for Containers
Recently Microsoft announced the integration of Prometheus, a popular open-source metric monitoring solution and part of Cloud Native Compute Foundation, with Azure Monitor for containers. This integration is currently available in a preview stage for testing.
-
Vector Performance Monitoring Tool Adds eBPF, Unified Host-Container Metrics Support
Vector, the open source performance monitoring tool from Netflix, added support for eBPF based tools using a PCP daemon, a unified view of container and host metrics, and UI improvements.
-
Scaling, Incident Management and Collaboration at New York Times Engineering
The New York Times Engineering Team wrote about their approach to scaling and incident management against the backdrop of increased traffic during the November 2018 US midterm elections.
-
Testing Complex Distributed Systems at FT.com: Sarah Wells Shares Lessons Learned
The complexity in complex distributed systems isn’t in the code, it’s between the services or functions. Testing implies balancing finding problems versus delivering value, said Sarah Wells at the European Testing Conference. Testers often have the best understanding of what the system does; they have a good hypothesis about what went wrong, and are able to validate it pretty quickly.
-
Amazon Introduces AWS Cloud Map: "Service Discovery for Cloud Resources"
In a recent blog post, Amazon introduced a new service called AWS Cloud Map which discovers and tracks cloud resources. With the rise of microservice architectures, it has been increasingly difficult to manage dynamic resources in these architectures. But, using AWS Cloud Map, developers can monitor the health of databases, queues, microservices, and other cloud resources with custom names.
-
Grafana Adds Log Data Correlation to Time Series Metrics
The Grafana team announced an alpha version of Loki, their logging platform that ties in with other Grafana features like metrics query and visualization. Loki adds a new client agent promtail and serverside components for log metadata indexing and storage.
-
Inside Stack Overflow’s Monitoring Systems
Nick Craver, architecture lead at Stack Exchange, wrote about their monitoring systems in a recent article. He discussed the philosophy and motivation behind their monitoring strategy and talked about their toolset - mainly Bosun, Grafana and Opserver.
-
Scaling Observability at Uber: Building In-House Solutions, uMonitor and Neris
Uber’s infrastructure consists of thousands of microservices supporting mobile applications, infrastructure, and internal services. To provide high observability of these services, Uber’s Observability team built two in-house monitoring solutions: uMonitor for time-series metrics-based alerting, and Neris for host-level checks and metrics.