BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Register Sign in

Unlock the full InfoQ experience

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources.

Log In

or

Don't have an InfoQ account?

Stay updated on topics and peers that matter to youReceive instant alerts on the latest insights and trends.
Quickly access free resources for continuous learningMinibooks, videos with transcripts, and training materials.
Save articles and read at anytimeBookmark articles to read whenever youre ready.

Logo - Back to homepage

News Articles Presentations Podcasts Guides

Topics

Development

Featured in Development

Expanding Swift from Apps to Services

Cory Benfield discusses the evolution of Swift from an app language to a critical tool for secure, high-scale services. He explains how Swift’s lack of a garbage collector eliminates tail latency and shares how its "zero-cost abstractions" rival C performance. He shares Apple’s roadmap for incremental adoption and demonstrates groundbreaking new interoperability for C++ and Java ecosystems.

All in development

Architecture & Design

Featured in Architecture & Design

You’ve Generated Your MVP Using AI. What Does That Mean for Your Software Architecture?

AI‑generated code creates implicit architectural decisions, forcing teams to rely on experimentation to validate quality attributes. To get useful results from AI, teams must clearly express trade‑offs and reasoning so the model can generate solutions aligned with desired QARs.

All in architecture-design

AI Infrastructure

Featured in AI, ML & Data Engineering

Building Embedding Models for Large-Scale Real-World Applications

Sahil Dua discusses the critical role of embedding models in powering search and RAG applications at scale. He explains the transformer-based architecture, contrastive learning techniques, and the process of distilling large language models into production-ready student models. He shares insights on optimizing query latency, handling document indexing, and evaluating retrieval quality.

All in ai-ml-data-eng

Culture & Methods

Featured in Culture & Methods

Beyond Code: How Engineers Need to Evolve in the AI Era

In this podcast, Shane Hastie, Lead Editor for Culture & Methods, spoke to Ben Greene about embracing AI in software engineering, expanding beyond pure technical skills to understand business context, and prioritizing human empathy in increasingly automated systems..

All in culture-methods

DevOps

Featured in DevOps

From Alert Fatigue to Agent-Assisted Intelligent Observability

As systems grow, observability becomes harder to maintain and incidents harder to diagnose. Agentic observability layers AI on existing tools, starting in read-only mode to detect anomalies and summarize issues. Over time, agents add context, correlate signals, and automate low-risk tasks. This approach frees engineers to focus on analysis and judgment.

All in devops

Events

Helpful links

Choose your language

InfoQ Architect Certification

Join Luca Mezzalira for this 5-week online cohort. Master socio-technical architecture leadership.

Register Interest.

QCon London 2026

Learn what works in AI, architecture, data, security & FinTech.

Early Bird ends March 10.

Learn how leading engineering teams run AI in production—reliably, securely, and at scale.

Early Bird ends March 10.

QCon San Francisco

Learn what's next in AI and software, from teams already doing it.

Early Bird ends March 10.

InfoQ Homepage Failure Content on InfoQ

Failure

RSS Feed

Podcasts about Failure

RSS Feed

Architecture & Design

Architecture Should Model the World as it Really is: a Conversation with Randy Shoup

Randy Shoup
on Nov 10, 2025

Icon

51:15
Architecture & Design

Oliver Gould Discusses Architecting to Avoid and Recover from Failure

Oliver Gould
on Jan 01, 2017

Icon

33:24
Architecture & Design

Haley Tucker on Responding to Failures in Playback Features at Netflix

Haley Tucker
on Dec 09, 2016

Icon

33:20
Architecture & Design

Uber's Chief Systems Architect on their Architecture and Rapid Growth

Matt Ranney
on May 13, 2016

Icon

31:28

Articles about Failure

RSS Feed

Cloud

Designing Resilient Event-Driven Systems at Scale

Rajesh Kumar Pandey
on May 30, 2025
Cloud

Can We Trust the Cloud Not to Fail?

Lena Hall
on May 11, 2021
Culture & Methods

Q&A on the Book Fail to Learn

Ben Linders Scott Provence
on Sep 21, 2020
DevOps

Failover Conf Q&A on Building Reliable Systems: People, Process, and Practice

Angel Rivera Tiffany Jachja Heidi Waterhouse Jim Walker Dave Nielsen Laura Hofmann
on Apr 20, 2020
DevOps

An Engineer’s Guide to a Good Night’s Sleep

Nicky Wrightson
on Aug 20, 2019
Culture & Methods

The New Killer Apps: Teamwork and Weak Signal Detection Lessons from the Military

Brian Rivera
on Nov 13, 2018
Architecture & Design

Resilient Systems in Banking

Greg Hawkins
on Oct 06, 2018
Culture & Methods

Soft Skill Patterns for Software Developers: The “Learning from Unintended Failures” Pattern

Kevin Jackson
on Dec 05, 2017
Culture & Methods

Q&A with Ash Maurya on Scaling Lean

Ben Linders
on Apr 26, 2017
Culture & Methods

Adaptable or Predictable? Strive for Both – Be Predictably Adaptable!

Dimitar Bakardzhiev
on Sep 06, 2016
Culture & Methods

Q&A with Diomidis Spinellis on Effective Debugging

Ben Linders
on Aug 24, 2016

News about Failure

RSS Feed

Culture & Methods

Applying DevOps Principles and Practices as a Quality Assurance Engineer

Ben Linders
on Mar 20, 2025
Culture & Methods

How to Improve Software Team Performance with Experimentation

Ben Linders
on Oct 03, 2024
Culture & Methods

A Distributed System is Knowable: an Impossible Thing for Developers

Ben Linders
on Sep 01, 2022
Architecture & Design

Dealing with Thundering Herd at Braintree

Sergio De Simone
on May 19, 2022
Cloud

Microsoft Announces Azure Chaos Studio in Public Preview

Steef-Jan Wiggers
on Nov 10, 2021
Culture & Methods

How a Safe-to-Fail Approach Can Enable Psychological Safety in Teams

Ben Linders
on Oct 14, 2021
DevOps

AWS Announces Chaos Engineering as a Service Offering

Matt Campbell
on Dec 21, 2020
Java

New LiveRecorder for Java Enables Software Failure Replay

Johan Janssen
on Aug 31, 2020
DevOps

Cloudflare’s 27 Minutes Outage Explained

Aditya Kulkarni
on Aug 29, 2020
DevOps

Failure Modes and Building Resilient Systems: Adrian Cockcroft at QCon SF

Matt Campbell
on Dec 18, 2019
DevOps

How Did Things Go Right? Learning More from Incidents at Netflix: Ryan Kitchens at QCon New York

Daniel Bryant
on Jul 05, 2019

Presentations about Failure

RSS Feed

Architecture & Design

The Art of Embracing Failures with Serverless Architectures

Anahit Pogosova
on Feb 19, 2025

Icon

50:45
Culture & Methods

Risk and Failure on the Path to Staff Engineer

Caleb Hyde
on Jul 10, 2024

Icon

46:41
Architecture & Design

Deconstructing an Abstraction to Reconstruct an Outage

Chris Sinjakli
on Dec 22, 2023

Icon

41:19
DevOps

How Did It Make Sense at the Time? Understanding Incidents as They Occurred, Not as They are Remembered

Jacob Scott
on Sep 14, 2023

Icon

38:15
Architecture & Design

Managing the Risk of Cascading Failure

Laura Nolan
on Jul 11, 2021

Icon

40:19
DevOps

Culturing Resiliency with Data: a Taxonomy of Outages

Ranjib Dey
on Dec 25, 2020

Icon

29:14
DevOps

Failing over without Falling over

Adrian Cockcroft
on Nov 20, 2020

Icon

21:34
Development

#FAIL

Kevlin Henney
on Nov 10, 2019

Icon

42:37
Culture & Methods

Rules in Agile Transformation: 80/20 and “Not Everybody Likes to Dance”

Zbigniew Piecuch
on Nov 01, 2019

Icon

31:39
DevOps

What Breaks Our Systems: A Taxonomy of Black Swans

Laura Nolan
on Oct 10, 2019

Icon

50:46

MORE PRESENTATIONS

BT