InfoQ Homepage Monitoring Content on InfoQ
-
Kubernetes Control Plane Metrics Now Available in Google Kubernetes Engine
Google has announced the general availability of Kubernetes control plane metrics in Google Kubernetes Engine (GKE). These metrics are directly integrated with Google Cloud Monitoring providing a single solution for troubleshooting issues with GKE. Integration with third-party observability tooling is also possible via the Cloud Monitoring API.
-
Grafana 9 Brings Big Improvements to Alerting and User Experience
Grafana, an open-source graphing tool, has reached its version 9 release. The key goals behind version 9 are improving the user experience, making observability and data visualization easy and accessible, and improving alerting.
-
Effectively Monitoring Your Monitoring - Miedwar Meshbesher on Using Vigilance Controls
With many open-source and paid tools available to do the job, it can be relatively straightforward to make sure that your systems are monitored properly. But, how does a team make sure that these systems are working as described, and alert the team effectively that there’s a problem with the system that is supposed to be keeping an eye on things?
-
Full-Stack Observability with Grafana and Azure Monitor
Microsoft recently introduced Azure Managed Grafana in preview, including new Grafana integrations with Azure Monitor. With Azure Managed Grafana, customers can now view their Azure monitoring data in Grafana dashboards and have new out-of-the-box Azure Monitor dashboards.
-
Service Overload Detection and Remediation at LinkedIn
LinkedIn recently published how it handles overload detection and remediation in its microservices. Its solution, Hodor, provides an adaptive solution that works out of the box with no configuration. It is a platform-agnostic mechanism to run overload detectors and load shedders inside the monitored process that samples load and sheds traffic from within the application's processing chain.
-
Google Cloud Python Logging Library Release Improves Serverless Support
Google has announced version 3.0.0 of their Google Cloud Python logging library. The release brings a number of new features including enhanced support for Cloud Run and Cloud Functions, support for string JSON payloads, and automated metadata attachments.
-
NGINX Controller Application Delivery Modules Improve Health Checks and Caching Configurations
NGINX has released new versions of their NGINX Controller Application Delivery Module, a control plane solution for NGINX Plus load balancers. The new features include enhanced workload health-checks, improvements to caching configuration, and instance groups.
-
Microsoft Open-Sources GCToolkit to Tap into JVM GC Logs
By open sourcing GCToolkit, Microsoft promises to tap into the rich information stored within the JVM’s GC logs. The tool promises to provide mechanisms to parse, analyse and visualise the GC log files in a human friendly manner.
-
Datadog Publishes Best Practices for Monitoring Dark Launches
Datadog recently published the best practices for monitoring dark launches. The blog post includes a detailed description of dark launches, the different types of metrics and dashboards, as well as the best practices for monitoring them.
-
Amazon Introduces Cloudwatch Cross Account Alarms to Consolidate Management
Amazon CloudWatch recently announced cross account alarms, a new feature that enables customers to set alerts and take actions based on changes to metrics across different AWS accounts.
-
Need Help Tracking Cloud Emissions? Microsoft Previews Microsoft Cloud for Sustainability
At the recent Inspire 2021 conference, Microsoft announced the preview of Microsoft Cloud for Sustainability, a new service to help companies measure and manage their carbon emissions, set sustainability goals and take measurable action.
-
OpenTelemetry Specification Reaches 1.0 with Stability Guarantees and New Release Candidates
The OpenTelemetry specification has been promoted to v1.0.0. This milestone includes improved stability and backwards compatibility guarantees, as well as API and SDK release candidates available for a number of languages. With this release, both the tracing API and the tracing SDK are considered stable.
-
AWS Adds Distributed Tracing to Their Elasticsearch Service
Amazon has announced the addition of Trace Analytics to their Amazon Elasticsearch Service. Trace Analytics adds distributed tracing to their service with support for OpenTelemetry. This new feature also integrates with the AWS Distro for OpenTelemetry.
-
Lightstep Connects Tracing and Metrics with New Change Intelligence Feature
Lightstep has released a number of improvements to their observability platform. These include native support for OpenTelemetry metrics, a new underlying time series database, and Change Intelligence, a new feature that looks to connect unusual patterns with impacting changes by bringing together system metrics and trace data.
-
Complimentary InfoQ Live Feb 16 Roundtables. Discover Valuable Insights to Implement Immediately
InfoQ Live, the one-day event for developers and engineers, is only a week away (Feb 16). Grab your ticket and deep-dive into practical ways you can use and integrate observability into your distributed system architecture.