Pedro Canahuati describes how Facebook's operations maintains their infrastructure, including challenges faced and lessons learned: prioritizing calls, managing technical debt, incident management.
Ben Christensen describes Netflix API's evolution to a web service platform serving all devices and users, the challenges met in operations, deployment, performance, fault-tolerance, and innovation.
Peter Niederwieser discusses building a continuous delivery pipeline with Gradle and Jenkins.
Joe Sondow presents how Netflix uses Asgard to deploy code updates and manage resources in the Amazon cloud.
Antoni Batchelli introduces VMFest, a PalletOps project used to turn VirtualBox into a lightweight cloud provider, good for developing cloud automation.
Antoni Batchelli discusses building an automated infrastructure in Clojure.
Raffi Krikorian explains the architecture used by Twitter to deal with thousands of events per sec - tweets, social graph mutations, and direct messages-.
Jesse Robbins explains how to evangelize & overcome cultural resistance to change while sharing his own painfully funny lessons on how not to do it.
Roy Rapoport discusses how Netflix uses metrics to monitor and manage their operating environment along with some notes about their event management system.
Kathleen Ting details 8 misconfigurations that can bring ZooKeeper down.
Filippos Santas explains how to apply service-orientation principles, patterns, processes and SOA governance precepts to ITIL's service lifecycle stages, key processes and activities.
Koa McCullough presents best practices for running Percona Server and MySQL in the cloud, cloud backups using EBS, Xtrabackup and S3, using Percona Toolkit to simplify operations, and XtraDB Cluster.