BT
Older rss
40:48

Data Science in the Cloud @StitchFix

Posted by Stefan Krawczyk  on  Feb 17, 2017 Posted by Stefan Krawczyk  on  Feb 17, 2017

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

44:06

Scaling the Data Infrastructure @Spotify

Posted by Mārtiņš Kalvāns  on  Jan 28, 2017 Posted by Mārtiņš Kalvāns Matti Pehrs  on  Jan 28, 2017

Mārtiņš Kalvāns and Matti Pehrs overview the Data Infrastructure at Spotify, diving into some of the data infrastructure components, such us Event Delivery, Datamon and Styx.

01:07:25

Data Microservices in the Cloud

Posted by Mark Pollack  on  Jan 08, 2017 Posted by Mark Pollack  on  Jan 08, 2017

Mark Pollack introduces Spring Cloud Data Flow enabling one to create pipelines for data ingestion, real-time analytics and data import/export, demoing apps that are deployed onto multiple runtimes.

39:14

Targeting Your Audience: Data Visualization to Communicate Data Insights

Posted by Randy Krum  on  Dec 16, 2016 Posted by Randy Krum  on  Dec 16, 2016

Randy Krum explains how to use the power of data visualization to convey actionable insights to an audience, making data clear and memorable by showing the audience what the data means.

50:44

Ingest & Stream Processing - What Will You Choose?

Posted by Pat Patterson  on  Aug 14, 2016 1 Posted by Pat Patterson Ted Malaska  on  Aug 14, 2016 1

Pat Patterson and Ted Malaska talk about current and emerging data processing technologies, and the various ways of achieving "at least once" and "exactly once" timely data processing.

30:55

Journey from Data Integration to Data Science

Posted by Michael Wise  on  Aug 11, 2016 Posted by Michael Wise  on  Aug 11, 2016

Michael Wise discusses the journey from having data integrated across an organization, to employing data science to make good use of it.

01:26:12

Data Driven Action: A Primer on Data Science

Posted by Sarah Aerni  on  Dec 13, 2015 Posted by Sarah Aerni Srivatsan Ramanujam Jarrod Vawdrey  on  Dec 13, 2015

S Aerni, S Ramanujam and J Vawdrey present approaches and open source tools for wrangling and modeling massive datasets, scaling Java applications for NLP on MPP through PL/Java and much more.

50:40

Data Structure Adventures

Posted by Joseph Blomstedt  on  Oct 03, 2015 Posted by Joseph Blomstedt  on  Oct 03, 2015

Joseph Blomstedt presents ongoing work to build a new set of high performance data structures for Erlang, including both single process data structures as well as various concurrent data structures.

59:25

Making Distributed Data Persistent Services Elastic (Without Losing All Your Data)

Posted by Joe Stein  on  Sep 20, 2015 Posted by Joe Stein  on  Sep 20, 2015

Joe Stein introduces Mesos and managing data services on it, presenting use cases for replacing classic solutions (like cold storage) with new functionality based on these technology.

53:01

Design vs. Data: Enemies or Friends?

Posted by Mary Poppendieck  on  Aug 22, 2015 Posted by Mary Poppendieck  on  Aug 22, 2015

Big Design Upfront was considered so evil in the early days of Agile that it acquired its own acronym. It’s time we relearned that great products start with asking the right questions.

39:31

Responding Rapidly When You Have 100GB+ Data Sets in Java

Posted by Peter Lawrey  on  Jul 05, 2015 Posted by Peter Lawrey  on  Jul 05, 2015

Peter Lawrey discusses data-driven reactive systems, profiling latency distribution in such an environment, finding rare bugs, implementing resilience and monitoring.

47:40

Evolving a Data System

Posted by Simon Metson  on  Apr 28, 2015 Posted by Simon Metson  on  Apr 28, 2015

Simon Metson approaches the problem of evolving a data system; some patterns and anti-patterns both technical (polyglot systems, lambda architectures) and organisational (data silos, lava layers).

BT