Robert Benefield offers a pragmatic overview for discovering operational indicators that provide valuable insight in running and improving online services.
Paul Stack discusses using PowerShell and Puppet to administer Windows machines, showing how to configure a Windows server and set up a development environment in short time.
Ryan Vanderwerf explains how to create and deploy a Grails application on AWS VPC using various services such as RDS, S3, autoscaling, S3FS, EBS, etc.
Pedro Canahuati describes how Facebook's operations maintains their infrastructure, including challenges faced and lessons learned: prioritizing calls, managing technical debt, incident management.
Ben Christensen describes Netflix API's evolution to a web service platform serving all devices and users, the challenges met in operations, deployment, performance, fault-tolerance, and innovation.
Peter Niederwieser discusses building a continuous delivery pipeline with Gradle and Jenkins.
Joe Sondow presents how Netflix uses Asgard to deploy code updates and manage resources in the Amazon cloud.
Antoni Batchelli introduces VMFest, a PalletOps project used to turn VirtualBox into a lightweight cloud provider, good for developing cloud automation.
Antoni Batchelli discusses building an automated infrastructure in Clojure.
Raffi Krikorian explains the architecture used by Twitter to deal with thousands of events per sec - tweets, social graph mutations, and direct messages-.
Jesse Robbins explains how to evangelize & overcome cultural resistance to change while sharing his own painfully funny lessons on how not to do it.
Roy Rapoport discusses how Netflix uses metrics to monitor and manage their operating environment along with some notes about their event management system.