Jonathan Gray introduces Hydrator, an open source framework and user interface for creating data lakes for building and managing data pipelines on Spark, MapReduce, Spark Streaming and Tigon.
Ali Jalali presents how to develop a machine learning predictive analytics engine for big data analytics.
Amar Sharma talks about NBC Universal Microservices infrastructure and how the new way they approach software development has equipped them to make better decisions about product investment.
John Billings talks about winning over those skeptical about the benefits of microservices along with tips on caching, failure, interface changes, etc. for building a distributed system architecture.
Casey West talks about anti-patterns and corresponding best practices based on his experience building application infrastructure and platforms, as well as the applications which are deployed to them.
Sameer Farooqui demos connecting to the live stream of Wikipedia edits, building a dashboard showing what’s happening with Wikipedia datasets and how people are using them in real time.
Graeme Seaton discusses the drivers behind Big Data initiatives and how to approach them using the vast amounts of data available.
Andrew Psaltis talks about Apache Beam, which aims to provide a unified stream processing model for defining and executing complex data processing, data ingestion and integration workflows.
Saul Caganoff discusses the different use cases for API consumption and the technical affordances API designers can provide to support those use cases.
Chien Huey evaluates Marathon running on DC/OS as a replacement for Elastic Beanstalk and/or ECS in terms of functionality, ease of use as well as cost.
Kief Morris discusses building and maintaining a testing and hosting infrastructure for microservices, explaining the creation of a cloud-based infrastructure with Packer, Terraform, and Ansible.
Peter Bourgon and Matthias Radestock explain the theory behind Weave Mesh, some of the important key features, and demonstrate some exciting use cases, like distributed caching and state replication.