InfoQ Homepage DevOps Content on InfoQ

Presentations

RSS Feed

Newer Older

Architecture & Design

Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix

Anirudh Mendiratta and Benjamin Fedorka explain how Netflix handles massive traffic storms using service-level prioritized load shedding and client-side attempt budgets to protect critical path APIs.

Anirudh Mendiratta Benjamin Fedorka
on Jul 02, 2026

Icon

50:41
DevOps

The Time it Wasn't DNS

Sean Klein explains how Azure handles massive outages using modern incident analysis. Moving past the "Five Whys," he shares how systemic factors—not operator error—caused the 2023 global WAN outage.

Sean Klein
on Jun 23, 2026

Icon

43:59
AI, ML & Data Engineering

Write-Ahead Intent Log: a Foundation for Efficient CDC at Scale

Vinay Chella and Akshat Goel explain why they outgrew traditional CDC at scale. They share how they built Write-Ahead Intent Log (WAIL) using a proxy layer to decouple data replication.

Vinay Chella Akshat Goel
on Jun 18, 2026

Icon

51:26
AI, ML & Data Engineering

Automating the Web with MCP: Infra that Doesn’t Break

Paul Klein explains how to automate the web with MCP. He shares architectural strategies for running multi-tenant, cloud-hosted Chromium sandboxes to power AI browsing agents.

Paul Klein
on Jun 16, 2026

Icon

53:42
DevOps

Confidently Automating Changes across a Diverse Fleet

Netflix engineer Casey Bleifer explains how the company is automating fleet-wide code changes and migrations at scale, driving adoption timelines down from months to mere days with confidence.

Casey Bleifer
on Jun 09, 2026

Icon

39:28
Cloud

Mitigating Geopolitical Risks with Local-First Software and atproto

Martin Kleppmann explains how de facto standards, multi-cloud strategies, the AT Protocol, and local-first software can mitigate geopolitical risks and eliminate vendor lock-in.

Martin Kleppmann
on Jun 08, 2026

Icon

50:16
Architecture & Design

Architecting a Centralized Platform for Data Deletion at Netflix

Netflix Engineers Vidhya Arvind and Shawn Liu discuss the pillars of safe, large-scale data deletion. They explain strategies to eliminate data ghosts and manage tombstone resource contention.

Vidhya Arvind Shawn Liu
on Jun 04, 2026

Icon

49:41
DevOps

The Human Toll of Incidents & Ways to Mitigate it

Kyle Lexmond discusses the human side of major system failures. He shares psychological insights and architectural tactics for surviving high-pressure incident rooms.

Kyle Lexmond
on Jun 02, 2026

Icon

51:40
DevOps

Realtime and Batch Processing of GPU Workloads

Joseph Stein explains how to build a highly available private AI cloud. He shares blueprints on scaling vLLM on enterprise GPUs, implementing gateway guardrails, and optimizing batch workloads.

Joseph Stein
on May 26, 2026

Icon

38:24
Culture & Methods

From Legacy to Sovereignty: Driving the Future of Insurance through Platform Engineering

Sergiu Petean explains how to navigate the evolution from legacy systems to platform engineering. He shares data-driven insights on balancing DORA metrics, compliance, and strategic AI integration.

Sergiu Petean
on May 25, 2026

Icon

51:40
DevOps

The Ironies of A^2 I^2

J. Paul Reed explains the "ironies of automation" and AI in incident response. He discusses how reliance on AI can erode manual skills and camouflage system failures during high-stakes outages.

J. Paul Reed
on May 21, 2026

Icon

45:16
DevOps

Powering the Future: Building Your GenAI Infrastructure Stack

Merrin Kurian discusses Intuit’s GenOS, a generative AI operating system powering agents for 100M users. She explains the transition from chat assistants to "done-for-you" autonomous experiences.

Merrin Kurian
on May 19, 2026

Icon

50:39

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations