Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.
Oleg Zhurakousky discusses the Hadoop ecosystem – Hadoop, HDFS, Yarn-, and how projects such as Hive, Atlas, NiFi interact and integrate to support the variety of data used for analytics.
Micahel Klishin talks about how one can troubleshoot a distributed service-oriented system, focusing on Java, Spring, and RabbitMQ.
Adrian Cole overviews how to debug latency problems using call graphs created by Zipkin, taking a look at the ecosystem, including tools to trace Ruby, C#, Java and Spring Boot apps.
Sean T. Allen talks about creating repeatable tests using programmatic fault injection, message tracing, and auditing to create a trustworthy system which provides correct results.
Ian Fyfe discusses the different options for implementing speed-of-thought business analytics and machine learning tools directly on top of Hadoop.
Justin Smith discusses credential hygiene in distributed systems, covering topics such as key encrypting keys, hardware security modules, and promising advances in muti-party computation.
John Billings talks about winning over those skeptical about the benefits of microservices along with tips on caching, failure, interface changes, etc. for building a distributed system architecture.
Aysylu Greenberg revisits some features of modern distributed systems and shows three architectural patterns, their application, and reference papers that are relevant to today's distributed systems.
Caitie McCaffrey discusses the strategies for proving a correct system and less strenuous methods of testing, which can help increase our confidence that a system is doing the right thing.
Colin Humphreys talks about how the Cloud Foundry community deploys the distributed system powering the one of the most successful open source platforms, along with details on the tooling used.
Sean Cribbs discusses practical applications of academic research with a large scale distributed system, as well as membership and dissemination protocols and their application in practice.