BT
Older Newer rss
40:10

Subside Failure: Partitioning Time and Space

Posted by Sadek Drobi  on  May 21, 2016 Posted by Sadek Drobi  on  May 21, 2016

Sadek Drobi talks about the prismic.io API and how to understand the properties and the mechanics of a system, and to partition its different dimensions to avoid a domino style failure cascade.

35:48

The Myth of the Qualified Developer

Posted by Pete Smith  on  Apr 28, 2016 Posted by Pete Smith  on  Apr 28, 2016

Pete Smith shares from his experience, discussing what it means to fail and how to make the most of it

37:34

Microservices Antipatterns

Posted by Tammer Saleh  on  Apr 15, 2016 1 Posted by Tammer Saleh  on  Apr 15, 2016 1

Tammer Saleh talks about the mistakes made building microservices, when microservices are appropriate, where to draw the lines between services, performance issues, testing, debugging, failure, etc.

38:48

Architecting Distributed Databases for Failure

Posted by Fangjin Yang  on  Feb 27, 2016 Posted by Fangjin Yang  on  Feb 27, 2016

Fangjin Yang covers common problems and failures seen with distributed systems, and discusses design patterns that can be used to maintain data integrity and availability when everything goes wrong.

45:48

Redefining Failure at Yammer

Posted by Nate Fink  on  Feb 13, 2016 Posted by Nate Fink  on  Feb 13, 2016

Nate Fink shares how Yammer has changed everything from how they structure teams to the role of managers to how they measure progress so they can not only survive but thrive learning.

49:40

Partial Failures in a Microservices Jungle: Survival Tips from Comcast

Posted by Jon Moore  on  Sep 27, 2015 Posted by Jon Moore  on  Sep 27, 2015

Jon Moore goes over some strategies for surviving in a jungle of partial failures. Each survival tip is explained through a concrete example, or "adventure story", from Comcast’s TV experience.

28:27

How to Steal Your Competitors' Talent

Posted by Matt Buckland  on  Sep 25, 2015 Posted by Matt Buckland  on  Sep 25, 2015

Matt Buckland discusses some of the cultures he has encountered in his work experience, the success stories and the failures, outlining what makes a great organizational culture.

36:01

Circuit Breaking in Distributed Systems

Posted by Matt Heath  on  Sep 11, 2015 Posted by Matt Heath  on  Sep 11, 2015

Matt Heath discusses how circuit breakers and other similar patterns can be used to increase reliability in distributed systems such as Go-based microservice platforms.

48:04

Fail Better: Radical Ideas from the Practice of Cloud Computing

Posted by Tom Limoncelli  on  Sep 03, 2015 2 Posted by Tom Limoncelli  on  Sep 03, 2015 2

Tom Limoncelli discusses creating resiliency at the most economic level, doing risky procedures often, and creating a blameless culture to encourage communication and improve system reliability.

42:37

Breaking Bad at Netflix: Building Failure as a Service

Posted by Kolton Andrus  on  Aug 27, 2015 Posted by Kolton Andrus  on  Aug 27, 2015

Kolton Andrus presents how Netflix, in order to harden their systems, designed “Failure as a Service” to allow anyone to test and validate how their systems handle failure.

42:31

Resilience, Service Discovery and Zero Downtime Deployment in Microservice Architectures

Posted by York Xyander  on  Jul 31, 2015 Posted by York Xyander Bodo Junglas  on  Jul 31, 2015

York Xyander, Bodo Junglas discuss strategies for service discoverability and transparent failover in a microservices architecture, how to achieve zero downtime and an auto-scaling architecture.

43:49

Managing Internal and External Services for a High Uptime Goal

Posted by Shobana Radhakrishnan  on  Apr 05, 2015 Posted by Shobana Radhakrishnan  on  Apr 05, 2015

Shobana Radhakrishnan shares details about best practices adopted in implementing API integration with third party services, how to manage change and deal with failures.

BT