InfoQ Homepage Monitoring Content on InfoQ
-
Honeycomb Announces Frontend Observability Tool with Improved Real User Monitoring
Honeycomb has introduced an early access program for its new tool, Honeycomb for Frontend Observability, which aims to enhance web application performance. The tool integrates OpenTelemetry instrumentation to collect and analyze Core Web Vitals data. Its main feature, the Web Launchpad, provides deep insights into site performance, helping users identify and resolve issues impacting these metrics.
-
AWS Adds Auto-Instrumentation of Python Applications to CloudWatch Application Signals
AWS announced the support of auto-instrumentation of Python applications to the Amazon Cloudwatch Application Signals feature. Application Signals enables the use of AWS Distro for OpenTelemetry to instrument Python applications without code changes. Application Signals allows for the monitoring of service-level objectives (SLOs) and works with several AWS observability and monitoring tooling.
-
Enhance Cloud-Based Monitoring with Azure Monitor Pipeline
Azure Monitor is Microsoft's cloud monitoring service for gathering, visualizing, and analyzing telemetry data from applications, infrastructure, and networks. The company recently added a data collection capability in preview with the edge pipeline, which enables the collection and routing of telemetry data before it's sent to the cloud.
-
Open-Source APM and Observability Tool Coroot Now GA
The open-source application performance monitoring system Coroot is now generally available. Announced as an alternative to DataDog and NewRelic, the observability tool leverages eBPF for insights into system performance and supports monitoring heterogeneous infrastructures.
-
Slack Conquers Deployment Fears with Z-score Monitoring
An engineer at team communication platform Slack has written about confronting their fear of deployments and successfully implementing a bot to monitor them instead. Describing a seemingly scary delegation of responsibility, Sean McIlroy explains the reasoning and logic behind giving a pivotal role to their ReleaseBot in a detailed blog post.
-
CNCF Survey: Half of Organizations Spend More with Kubernetes, Mostly Due to Overprovisioning
CNCF published the results of its latest microsurvey report on cloud-native FinOps and cloud financial management (CFM). Kubernetes has driven cloud spending up for 49% of respondents, while 28% stated their costs remain unchanged and 24% saved after migrating to Kubernetes. Respondents listed overprovisioning, lack of awareness and responsibility, and sprawl as the main factors for overspending.
-
AI and FinOps Predicted to Lead Observability Innovation in 2024
In recently published articles, three large observability companies have made predictions for the trends we will see in the observability area in 2024 and beyond. These contributions suggest that the fields of AI Integration, FinOps, OpenTelemetry and Security and Governance will impact observability significantly in the year ahead.
-
Azure Monitor Metrics Data Plane API for Enhanced Resource Insight Gathering
Microsoft recently announced the general availability (GA) of Azure Monitor Metrics Data Plane API, which makes resource insight gathering more efficient by providing developers with the capability to retrieve the metric data for up to 50 resource IDs in the same subscription and region in a single batch API call, thus improving query throughput and reducing the risk of throttling.
-
Amazon CloudWatch Alarms Can Now Directly Trigger Lambda Functions
AWS recently announced that Amazon CloudWatch alarms now support AWS Lambda functions as an action for state changes. This new feature enables developers to automate remediation actions when detecting an unhealthy resource.
-
AWS Improves Kubernetes Monitoring with New Enhanced Observability for Amazon EKS
AWS has released several monitoring and observability-themed improvements. The releases include Amazon CloudWatch Container Insights with Enhanced Observability for Amazon EKS on EC2 a fully managed service that provides visibility into containerized workloads. Other releases include a new Amazon Managed Service for Prometheus collector and Amazon Managed Grafana community plugins.
-
Grafana Cloud Kubernetes Monitoring with Machine Learning Predictions
Managing cloud costs can be challenging as Kubernetes fleets scale. To address this issue, Grafana Cloud has introduced a cost-monitoring feature within Kubernetes Monitoring. In particular, Grafana Cloud’s Kubernetes Monitoring now offers ML predictions for CPU and memory usage.
-
eBPF Kubernetes Security Tool Tetragon Improves Performance and Stability
Isovalent has announced the 1.0 release of Cilium Tetragon, their eBPF-based Kubernetes security observability and runtime enforcement tool. Policies and filters can be applied directly via eBPF to monitor process execution, privilege escalations, and file and network activity.
-
Monzo Employs Targeted Traffic Shedding against Stampeding Herd Effect from the Mobile App
Monzo developed a solution for shedding traffic in case its platform comes under intense and unexpected load that could lead to an outage. Traffic spikes can be generated by the mobile app and triggered by push notifications or other bursts in user activity. The solution can reduce the read traffic by almost 50% with 90% overall accuracy without noticeable customer impact.
-
Contentsquare Uses Microservices and Apache Kafka for Notification Delivery
Contentsquare needed notification functionality for many use cases within its platform. The company created a generic solution spanning multiple services as part of its microservice architecture. During the implementation, the developers had to improve observability and overcome some scalability challenges.
-
Grafana Introduces ML Tool Sift to Improve Incident Response
Grafana Labs has introduced "Sift," a feature for Grafana Cloud designed to enhance incident response management (IRM) by automating system checks and expediting issue resolution. Sift automates various aspects of incident investigation. Sift provides valuable insights into potential issues within Kubernetes environments, helping engineers focus on resolving incidents.