InfoQ Homepage Observability Content on InfoQ
-
LinkedIn Leverages GitHub Actions, CodeQL, and Semgrep for Code Scanning
LinkedIn has rebuilt its static application security testing (SAST) pipeline using GitHub Actions and custom workflows, enabling consistent, enforceable code scanning across thousands of repositories. The redesign improves security coverage, developer workflow, and observability while supporting the company’s shift-left strategy.
-
Datadog Integrates Google Agent Development Kit into LLM Observability Tools
Datadog recently announced that its LLM Observability platform now provides automatic instrumentation for applications built with Google's Agent Development Kit (ADK), offering deeper visibility into the behavior, performance, cost, and safety of AI-driven agentic systems.
-
Airbnb Expands Global Checkout with “Pay as a Local,” Scaling to 220 Markets in 14 Months
Airbnb expands its global checkout with the “Pay as a Local” initiative, supporting over 20 locally preferred payment methods across 220 markets. The company replatformed its payments system with domain-oriented services, reusable flow archetypes, and a centralized configuration, enhancing integration speed, reliability, testing, and observability for diverse payment methods worldwide.
-
Uber Gets Ready for AI in Network Observability with Cloud Native Overhaul
Transportation company Uber has publishing a detailed account of its new observability platform on its blog, highlighting that for them, network visibility is now a strategic capability rather than a set of discrete monitoring tools.
-
Railway Highlights the Importance of Logs, Metrics, Traces, and Alerts for Diagnosing System Failure
Railway’s engineering team published a comprehensive guide to observability, explaining how developers and SRE teams can use logs, metrics, traces, and alerts together to understand and diagnose production system failures.
-
HL is a Fast, Rust-Based JSON Log Viewer Offering up to 2GiB/s Parsing Speed
Open-source log viewer hl is designed for efficient processing of structured logs in JSON or logfmt format. Built in Rust, it provides fast indexing and parsing, enabling to scan very large log files quickly, whether they are uncompressed or compressed.
-
Grab Adds Real-Time Data Quality Monitoring to Its Platform
Grab updated its internal platform to monitor Apache Kafka data quality in real time. The system uses FlinkSQL and an LLM to detect syntactic and semantic errors. It currently tracks 100+ topics, preventing invalid data from reaching downstream users. This proactive strategy aligns with industry trends to treat data streams as reliable products.
-
AWS Distributed Tracing Service X-Ray Transitions to OpenTelemetry
AWS recently announced that AWS X-Ray is transitioning to OpenTelemetry as its primary instrumentation standard for application tracing, with the AWS X-Ray SDKs and Daemon moving to maintenance mode.
-
Groundcover Takes Aim at Datadog with Observability Migration Tool
Observability platform company Groundcover has launched a new migration tool to help organisations move their observability stacks from other vendors (such as Datadog) to its own platform. The company is claiming that organisations can migrate metrics, dashboards and monitors with full automation, and without needing any downtime nor consultants.
-
Grafana Unveils Smarter Logs, an MCP Server, and TraceQL Upgrades in Latest Releases
Grafana Labs has published major updates across two of its core observability products: Grafana 12.3, and Grafana Tempo 2.9. The two releases have distinct improvements in monitoring, logs, and tracing for Grafana users.
-
Grafana Labs Releases Mimir 3.0 with Redesigned Architecture for Enhanced Performances
Grafana Labs has released Grafana Mimir 3.0. This is a significant advancement for the open-source, horizontally scalable time series database. The release features a new design that separates read and write operations. This change greatly boosts performance, reliability, and cost efficiency for organizations handling metrics at scale.
-
Inside Uber’s Query Architecture: Simplifying Layers and Improving Observability
Uber rebuilt its Apache Pinot query architecture, replacing the Presto-based Neutrino system with a lightweight proxy called Cellar and Pinot’s Multi-Stage Engine Lite Mode. The redesign simplifies SQL execution, improves resource management, and ensures predictable performance for large-scale analytics workloads.
-
QCon London 2026 Announces Tracks: AI Engineering, Building Teams, Tech of Finance, and More
The QCon London 2026 tracks are live: 15 practitioner-curated deep dives on AI adoption, resilient architectures, distributed systems, performance, modern languages, data, security, and Staff+ leadership, rooted in real production lessons.
-
Flipkart Scales Prometheus to 80 Million Metrics Using Hierarchical Federation
Flipkart engineers recently published a detailed case study describing how they overcame severe scalability limits in monitoring by adopting a hierarchical federation design in Prometheus.
-
Vercel Introduces Drains for Unified Data Export
Vercel has released Vercel Drains, a system for exporting observability data from its platform into external services. The feature unifies logs, distributed traces, web analytics events, and performance metrics into a single streaming mechanism.