Pedro Canahuati describes how Facebook's operations maintains their infrastructure, including challenges faced and lessons learned: prioritizing calls, managing technical debt, incident management.
Ben Christensen describes how the Netflix API evolved from a typical one-size-fits-all RESTful API designed to support public developers into a web service platform optimized to handle the diversity and variability of each device and user experience. The talk will also address the challenges involving operations, deployment, performance, fault-tolerance, and rate of innovation at massive scale.
Joe Sondow presents how Netflix uses Asgard to deploy code updates and manage resources in the Amazon cloud.
Roy Rapoport discusses how Netflix uses metrics to monitor and manage their operating environment along with some notes about their event management system.
Filippos Santas explains how to apply service-orientation principles, patterns, processes and SOA governance precepts to ITIL's service lifecycle stages, key processes and activities.
Phil Toland discusses using Erlang and Ruby providing backup for 20k network devices running in 8 datacenters across 3 continents for Rackspace’s operations.
Ram C Singh discusses using Big Data for infrastructure telemetry along with good practices and an autonomic engine to create an autonomic computing infrastructure that might prevent downtime.
Jez Humble discusses how to deal with risk management, regulation compliance, ITIL, audit requirements in a large organization that intends to adopt devops.
Julian Simpson recommends practices to bring together development and operations, like: collaboration, don’t use email for internal communication, respect everyone, have lunch with the other team, discuss problems, joined retrospectives and stand-up meetings, co-locate sysadmins and developers, teach sysadmin to use VC, use CI and continues deployment, separate binaries and configuration files.