InfoQ Homepage Observability Content on InfoQ
-
Kubernetes Control Plane Metrics Now Available in Google Kubernetes Engine
Google has announced the general availability of Kubernetes control plane metrics in Google Kubernetes Engine (GKE). These metrics are directly integrated with Google Cloud Monitoring providing a single solution for troubleshooting issues with GKE. Integration with third-party observability tooling is also possible via the Cloud Monitoring API.
-
Azure Managed Grafana Now Generally Available
Microsoft recently announced the general availability (GA) of Azure Managed Grafana, a managed service that enables customers to run Grafana natively within the Azure cloud platform. With the managed service, they can connect to existing Azure Services to enhance observability and cloud management.
-
Standardising Observability and Incident Management at Miro
The Miro Data Engineering team recently discussed how they systematised alerts and incident management. Along with standardising the observability metrics and alert(s) definitions, the team started using OpsGenie for incident management. This helped the team address challenges with scaling such as standard format for metric labelling, alert definitions, on-call duties, etc.
-
Programming Observability: Measuring the Maturity of Observability as Code
Observability can be programmed and automated with observability as code. A maturity model can be used to measure and improve the adoption of observability as code implementation. Yury Niño Roa, cloud infrastructure engineer at Google, spoke about programming observability at InfoQ live August 2022.
-
Dealing with Cognitive Load Using Observability
We can make good decisions with speed when we limit the cognitive load on any one person or team. Observability can help to increase delivery speed, by providing information to developers that helps them to make decisions quickly.
-
Grafana 9 Brings Big Improvements to Alerting and User Experience
Grafana, an open-source graphing tool, has reached its version 9 release. The key goals behind version 9 are improving the user experience, making observability and data visualization easy and accessible, and improving alerting.
-
AWS Lambda Powertools for TypeScript Now Generally Available
Amazon recently announced the general availability (GA) of AWS Lambda Powertools TypeScript. The set of utilities for Node.js Lambda functions helps developers follow best practices for tracing, structured logging, and custom metrics.
-
Applying Observability to Increase Delivery Speed and Flow in Teams
When we design team and departmental processes, we want to know what’s happening in the software teams. Asking team members to provide information or fill in fields in tools adds a burden and distorts reality. Setting up observability in the software can provide alternative insights in a less intrusive way. Observability in the software can be an asset to organizing teams.
-
Microsoft Launches the Public Preview of Dynatrace for Azure as a SaaS Solution in Their Marketplace
Microsoft recently announced Dynatrace for Azure, a natively integrated software (SaaS) solution from Dynatrace available in preview in the Azure Marketplace.
-
New Observability Features for AWS Step Functions
Recently, AWS announced a new opt-in capability on the state-machine page for Step Functions. Developers can quickly analyze, debug, and optimize Standard Workflows by enabling the new execution page option.
-
Full-Stack Observability with Grafana and Azure Monitor
Microsoft recently introduced Azure Managed Grafana in preview, including new Grafana integrations with Azure Monitor. With Azure Managed Grafana, customers can now view their Azure monitoring data in Grafana dashboards and have new out-of-the-box Azure Monitor dashboards.
-
Amazon Managed Service for Prometheus Now GA: Alerting, PromQL Support, and More
Recently AWS announced the general availability (GA) of Amazon Managed Service for Prometheus (AMP), a Prometheus-compatible monitoring service for container infrastructure and application metrics for containers.
-
Litmus 2.0 Release Includes Multi-Tenancy, Chaos Workflows, GitOps, and Observability
Last month, Litmus 2.0 was released for general availability, with the goal of simplifying chaos engineering by adding new features like chaos center, chaos workflows, GitOps for chaos, multi-tenancy, observability, and private chaos hubs. InfoQ interviewed Umasankar Mukkara, CEO of ChaosNative and co-creator and maintainer of Litmus engineering platform.
-
The eBPF Foundation Aims to Further Advance eBPF Features and Adoption
eBPF, a technology used to extend the Linux kernel capabilities without requiring to change its code or reload kernel modules, now has its own foundation hosted within the Linux Foundation, announce Facebook, Google, Isovalent, and other founding members.
-
AWS Announces the General Availability of Lambda Extensions
Recently AWS announced Lambda extensions' general availability (GA), which allows its users to integrate Lambda functions with their preferred tooling for monitoring, observability, security, and governance. The GA release includes performance improvements and an expanded set of partners.