InfoQ Homepage Observability Content on InfoQ
-
New Open Source Tool Subtrace Brings Network Analysis to Container Environments
Y Combinator startup Subtrace has released an open-source tool to help analyse network traffic from containerised applications. The creators have positioned it as "Wireshark for Containers " and aim to simplify network debugging in Docker and Kubernetes environments.
-
Vercel Adds External API Caching Analytics to Observability
Vercel has enhanced its observability platform by integrating external API caching insights, enabling developers to track how many requests to third-party APIs are served from the Vercel Data Cache versus being routed to the origin server.
-
Inside Netflix’s Title Launch Observability System: Validating Title Availability at Global Scale
Netflix has developed a platform called Title Launch Observability, which shifts observability from system health to product intent. Instead of relying solely on logs and metrics, the system validates launches against what users should see, catching content quality issues early. The platform helps detect issues such as missing artwork, incorrect recommendations, or localization gaps.
-
Logz.io and Dynatrace Innovations Shift Observability into the AI Age
Major observability platform providers are integrating artificial intelligence into their monitoring systems, as enterprises look to their suppliers to reduce the manual work involved in keeping an eye on digital infrastructure. Companies have implemented AI-driven features designed to automate routine operational tasks and accelerate incident resolution processes.
-
AWS Lambda Gains Native Avro and Protobuf Support for Kafka Events with Schema Registry Integration
Lambda now natively supports Apache Avro and Protobuf events, streamlining Kafka event processing - an enhancement that eliminates the need for custom deserialization, automates schema validation and filtering, and optimizes costs through efficient event handling. Integration with AWS Glue and Confluent registries simplifies development, allowing cleaner data consumption and enhanced scalability.
-
Microsoft Azure Enhances Observability with OpenTelemetry Support for Logic Apps and Functions
Microsoft has expanded OpenTelemetry support in Azure Logic Apps and Functions, enhancing observability and interoperability across platforms. This open-source framework enables seamless data generation and correlation, enhancing diagnostics beyond standard telemetry. With streamlined configuration and integration, Azure's offerings aim for standardized observability across cloud services.
-
AWS Shield Network Security Director: Network Topology Visibility and Remediation Guidance
Introducing AWS Shield Network Security Director: a game-changer in DDoS protection and network security visibility. This innovative feature automates resource discovery, evaluates configurations against best practices, and prioritizes security findings. With actionable remediation steps and natural language queries via Amazon Q Developer, organizations can enhance their security posture.
-
Amazon API Gateway Adds Dynamic Routing Based on Headers and Paths
AWS's new dynamic routing rules for Amazon API Gateway empower developers to streamline API traffic management by routing requests based on HTTP headers without complex URL structures. This innovative feature simplifies API versioning, enables fine-grained control, enhances A/B testing, and improves request visibility, making API configurations more efficient and user-friendly.
-
Applying Observability to Leadership to Understand and Explain your Way of Working
Leadership observability means observing yourself as you lead, treating yourself as the system that is under observation. Alex Schladebeck shared how narrating thoughts, using mind maps, asking questions, and identifying patterns helped her as a leader to explain decisions, check bias, support others, and understand her actions and challenges.
-
AWS Launches EKS Dashboard to Tackle Multi-Cloud Kubernetes Complexity
Introducing the Amazon EKS Dashboard: a centralized management tool delivering unified visibility across multiple Kubernetes clusters in AWS. Simplifying operational oversight, it offers insights on resource distribution, health metrics, and cost forecasting. Designed for ease, it enhances compliance checks and empowers strategic planning with data-driven insights.
-
Travel Giant Skyscanner Overhauls Observability, Cuts Telemetry Costs by 90%
The COVID-19 pandemic gave the engineering teams at travel search giant Skyscanner an opportunity to introspectively examine their observability stack. Skyscanner has written about how it has overhauled its approach to technical observability with a system that improves reliability for engineers and travellers alike.
-
AWS Lambda Introduces Tiered Pricing for CloudWatch Logs and Expands Logging Destinations
AWS enhances Lambda logging with tiered pricing for Amazon CloudWatch Logs, effective May 1, 2025, reducing costs for high-volume deployments. New destinations like Amazon S3 and Firehose simplify integration and enable advanced analytics. These changes promise significant savings and flexibility for AWS users while emphasizing the need for optimized logging strategies.
-
How Observability Can Improve the UX of LLM Based Systems: Insights of Honeycomb's CEO at KubeCon EU
During her KubeCon Europe keynote, Christine Yen, CEO and co-founder of Honeycomb, provided insights on how observability can help cope with the rapid shifts introduced by the integration of LLMs in software systems, which transformed not only the way we develop software but also the release methodology. She explained how to adapt your development feedback loop based on production observations.
-
Grafana Loki Introduces v3.4 with Standardized Storage and Unified Telemetry
Grafana Loki recently introduced their version 3.4, which includes enhancements aimed at improving the efficiency and log management standardization. One of the key updates is the integration of the Thanos Object Storage Client, which aligns Loki's storage configuration with other Grafana databases, such as Mimir and Pyroscope.
-
Traefik v3.3 Release: Enhanced Observability and Documentation
TraefikLabs recently announced the latest release of Traefik Proxy v3.3 (codenamed "saint-nectaire” after a French cheese). This release focuses primarily on two critical areas: observability capabilities and improved documentation structure. These enhancements aim to make the popular open-source reverse proxy even more powerful for platform engineers working in complex cloud-native environments.