InfoQ Homepage Monitoring Content on InfoQ
-
Distributed Tracing Tool Jaeger Releases Version 2 with OpenTelemetry at the Core
Version 2 of the Jaeger project, a leading open-source distributed tracing platform, has been released. This release contains a significant architectural transformation, as it brings Jaeger and its components into the OpenTelemetry framework.
-
Stripe Rearchitects Its Observability Platform with Managed Prometheus and Grafana on AWS
Stripe replaced its observability platform, which used a third-party vendor solution, with a new architecture utilizing managed services on AWS. The company made the move due to scalability limits, reliability issues, and increasing costs while transitioning to microservices. The migration involved dual-writing metrics, translating assets, validation, and user training.
-
Leveraging eBPF for Improved Infrastructure Observability
To efficiently and effectively investigate multi-tenant system performance, Netflix has been experimenting with eBPF to instrument the Linux kernel to gather continuous, deeper insights into how processes are scheduled and detect "noisy neighbors".
-
Meta Optimises AI Inference by Improving Tail Utilisation
Meta (formerly Facebook) has reported substantial improvements in the efficiency and reliability of its machine-learning model serving infrastructure by focusing on optimising tail utilisation.
-
Improving Mobile Test Automation with Continuous Integration, Central Logging, and Metrics Analysis
Continuous integration can enhance automated mobile testing. Test data from multiple mobile devices running parallel tests can be consolidated to support monitoring. Jira tickets from manual testing can trigger the build process to ensure that testers will have the correct software version to do the manual testing.
-
Apache Skywalking v10: Application Performance Monitoring Tool for Distributed Systems
The Apache Software Foundation has released version 10 of Apache SkyWalking, an open-source observability platform designed to provide comprehensive monitoring, tracing, and analytics for distributed systems. It features many new features and enhancements...
-
Cloudflare AI Gateway Now Generally Available
Cloudflare has recently announced that AI Gateway is now generally available. Described as a unified interface for managing and scaling generative AI workloads, AI Gateway allows developers to gain visibility and control over AI applications.
-
Honeycomb Announces Frontend Observability Tool with Improved Real User Monitoring
Honeycomb has introduced an early access program for its new tool, Honeycomb for Frontend Observability, which aims to enhance web application performance. The tool integrates OpenTelemetry instrumentation to collect and analyze Core Web Vitals data. Its main feature, the Web Launchpad, provides deep insights into site performance, helping users identify and resolve issues impacting these metrics.
-
AWS Adds Auto-Instrumentation of Python Applications to CloudWatch Application Signals
AWS announced the support of auto-instrumentation of Python applications to the Amazon Cloudwatch Application Signals feature. Application Signals enables the use of AWS Distro for OpenTelemetry to instrument Python applications without code changes. Application Signals allows for the monitoring of service-level objectives (SLOs) and works with several AWS observability and monitoring tooling.
-
Enhance Cloud-Based Monitoring with Azure Monitor Pipeline
Azure Monitor is Microsoft's cloud monitoring service for gathering, visualizing, and analyzing telemetry data from applications, infrastructure, and networks. The company recently added a data collection capability in preview with the edge pipeline, which enables the collection and routing of telemetry data before it's sent to the cloud.
-
Open-Source APM and Observability Tool Coroot Now GA
The open-source application performance monitoring system Coroot is now generally available. Announced as an alternative to DataDog and NewRelic, the observability tool leverages eBPF for insights into system performance and supports monitoring heterogeneous infrastructures.
-
Microsoft Azure Introduces Retina: a Cloud Native Container Networking Observability Platform
The Microsoft Azure Container Networking team recently announced an open-source project named Retina. This cloud-native container networking observability platform allows DevOps engineers and administrators to visualize, debug, and analyze workload traffic across diverse environments.
-
Expedia Speeds up Flights Search with Micro Frontends and GraphQL Optimizations
Expedia made flight search faster by up to 52% (page usable time) by applying a range of optimizations to web and mobile applications. To support these improvements, the company improved the observability of its applications. Expedia Flights web application has been migrated to Micro Frontend Architecture (MFA) to allow flexibility, reusability, and better optimization.
-
Java News Roundup: New JEP Drafts, Infinispan 15, Payara Platform, Alpaquita Containers with CRaC
This week's Java roundup for March 11th, 2024, features news highlighting: new JEP drafts, Stream Gatherers (Second Preview) and Hot Code Heap; Infinispan 15; the March 2024 edition of Payara Platform; Alpaquita Containers with CRaC; the first release candidate of JobRunr 7.0; and milestone and point releases for Spring projects, Quarkus, Helidon and Micronaut.
-
Uber Builds Scalable Chat Using Microservices with GraphQL Subscriptions and Kafka
Uber replaced a legacy architecture built using the WAMP protocol with a new solution that takes advantage of GraphQL subscriptions. The main drivers for creating a new architecture were challenges around reliability, scalability, observability/debugibility, as well as technical debt impeding the team’s ability to maintain the existing solution.