InfoQ Homepage DevOps Content on InfoQ
-
Discord Engineers Add Distributed Tracing to Elixir's Actor Model without Performance Penalty
Discord engineering detailed how they added distributed tracing to Elixir's actor model. Their custom Transport library wraps messages with trace context and uses dynamic sampling to handle million-user fanouts. CPU optimizations included skipping unsampled traces and filtering context before deserialization, recovering 10+ percentage points of overhead.
-
HashiCorp Vault 1.21 Brings SPIFFE Authentication, Granular Secret Recovery, and More
HashiCorp has released Vault 1.21. This version introduces native SPIFFE authentication for non-human workloads, expands the granular secret recovery model introduced in Vault 1.20, and adds KV v2 secret attribution, MFA TOTP self-enrollment, a Vault Secrets Operator CSI driver that mounts secrets directly into pods without persisting them in etcd, and more.
-
"Pick and Mix" Custom Regions: Cloudflare Introduces Fine-Grained Data Residency Control
Cloudflare recently introduced Custom Regions, an expansion of its Regional Services that lets customers precisely define where their data is processed. By selecting specific groups of data centers by country or region, customers can ensure that TLS termination and application-layer processing remain within chosen geographic boundaries for compliance and control.
-
QCon London 2026: AI Agents Write Your Code. What’s Left for Humans?
Hannah Foxwell began her QCon London 2026 talk by noting that the long-sought velocity in development has arrived, but the industry is unsure how to use it. She set aside the technical details of agentic coding, focusing instead on its implications for the people working with these systems.
-
Airbnb Rebuilt Alert Development after Discovering it Wasn’t a Culture Problem
Airbnb has revealed how it significantly improved its observability practices by rethinking how alerts are developed and validated, concluding that what appeared to be a "culture problem" was actually a tooling and workflow gap.
-
AWS S3 Introduces Account-Regional Namespaces, Ending 18 Years of Global Bucket Name Collisions
AWS introduced account-regional namespaces for S3, fixing global bucket name collisions that broke IaC automation for 18 years. The new format is {prefix}-{account-id}-{region}-an. CloudFormation gets the BucketNamePrefix property, and IAM gets the s3:x-amz-bucket-namespace condition key. This prevents confused-deputy attacks by making names unpredictable when there is no account ID.
-
AWS Load Balancer Controller Reaches GA with Kubernetes Gateway API Support
AWS shipped GA support for Kubernetes Gateway API in its Load Balancer Controller, dumping annotation-based configuration for type-safe CRDs with proper validation. The release handles both L4 (TCP/UDP via NLB) and L7 (HTTP/gRPC via ALB) routing through the Gateway API spec. Teams get cross-namespace routing, automatic certificate discovery, and role separation without cluster-admin permissions.
-
QCon London 2026: Shielding the Core: Architecting Resilience with Multi-Layer Defenses
Anderson Parra, staff software engineer at SeatGeek, presented “Shielding the Core: Architecting Resilience with Multi-Layer Defenses” at QCon London 2026. Parra discussed strategies on how to handle significant traffic spikes in systems that can overwhelm an even well-designed infrastructure.
-
Revenium Unveils Tool Registry to Expose the True Cost of AI Agents
Revenium has announced the general availability of its Tool Registry, a new capability designed to give enterprises a complete, end-to-end view of what their AI agents actually cost.
-
QCon London 2026: Fixing the AI Infra Scale Problem by Stuffing 1M Sandboxes in a Single Server
Unikraft CEO Felipe Huici demonstrated waking the one-millionth VM on a commodity server in ten milliseconds at QCon London. The talk traced a decade from academic unikernel research to a platform offering stateless scale-to-zero VMs with full isolation. Using Firecracker and VM snapshots, sleeping workloads resume instantly, turning server density from a hardware problem into a scheduling one.
-
Sonatype Launches Guide to Enhance Safety in AI-Assisted Code Generation
Sonatype Guide is a real-time guardrail system that sits between AI coding tools and the open-source ecosystem, ensuring AI-generated code uses safe, valid, and maintainable dependencies.
-
Harness Reimagines Artifact Management for DevSecOps with New Artifact Registry
Harness has announced the general availability of Harness Artifact Registry, a platform capability designed to simplify how engineering teams store, secure, and govern software artifacts within modern DevSecOps pipelines.
-
QCon London 2026: Kleppmann on Mitigating Europe's Cloud Dependency with Local-First Software
Europe is completely dependent on US cloud services, Martin Kleppmann told QCon London. His fix: commoditise everything. He walked through three technologies he's helped build: multi-cloud via de facto standards, Bluesky's AT Protocol for social media, and local-first software for collaboration, all designed to make switching providers trivial and shift power back to users.
-
QCon London 2026: Morgan Stanley Rethinks Its API Program for the MCP Era
Morgan Stanley engineers Jim Gough and Andreea Niculcea showed how they're retooling the bank's API program for AI agents using MCP and FINOS CALM. Live demos covered compliance guardrails, deployment gates, and zero-downtime rollouts across 100+ APIs. First API deployment shrank from two years to two weeks. They also demoed Google's A2A protocol running alongside MCP.
-
Microsoft Adds DRA-Backed NVIDIA vGPU Support to AKS
The Azure Kubernetes Service team shared a detailed guide on how to use Dynamic Resource Allocation (DRA) with NVIDIA vGPU technology on AKS. This update improves control and efficiency for shared GPU use in AI and media tasks.