BT

InfoQ Homepage On-call Content on InfoQ

Articles

RSS Feed
  • Sustainable Operations in Complex Systems with Production Excellence

    Successful long-term approaches to production ownership and DevOps require cultural change in the form of production excellence. Teams are more sustainable if they have well-defined measurements of reliability, the capability to debug new problems, a culture that fosters spreading knowledge, and a proactive approach to mitigating risk.

  • Observability-Driven Development for Tackling the Great Unknown

    How does observability-driven development differ from monitoring? As our distributed systems become increasingly more complicated and as our silos break down for DevOps testing, automation, and efficiency, ODD arises as a superset of monitoring to understand your code’s unknown unknowns. Includes insights from Honeycomb Founder Charity Majors.

  • Book Review: Site Reliability Engineering - How Google Runs Production Systems

    "Site Reliability Engineering - How Google Runs Production Systems" is an open window into Google's experience and expertise on running some of the largest IT systems in the world. The book describes the principles that underpin the Site Reliability Engineering discipline. It also details the key practices that allow Google to grow at breakneck speed without sacrificing performance or reliability.

BT

Is your profile up-to-date? Please take a moment to review and update.

Note: If updating/changing your email, a validation request will be sent

Company name:
Company role:
Company size:
Country/Zone:
State/Province/Region:
You will be sent an email to validate the new email address. This pop-up will close itself in a few moments.