BT
Older rss
25:30
Data Science Follow 457 Followers

Bias in BigData/AI and ML

Posted by Leslie Miley  on  Dec 23, 2017 1 Posted by Leslie Miley Follow 0 Followers  on  Dec 23, 2017 1

Leslie Miley discusses how inherent bias in data sets has affected things from the 2016 Presidential race to criminal sentencing in the United States.

46:58
Data Science Follow 457 Followers

Scaling with Apache Spark

Posted by Holden Karau  on  Aug 05, 2017 Posted by Holden Karau Follow 3 Followers  on  Aug 05, 2017

Holden Karau looks at Apache Spark from a performance/scaling point of view and what’s needed to handle large datasets.

50:45
Architecture & Design Follow 1133 Followers

Serverless Design Patterns with AWS Lambda: Big Data with Little Effort

Posted by Tim Wagner  on  Jul 29, 2017 Posted by Tim Wagner Follow 2 Followers  on  Jul 29, 2017

Tim Wagner discusses Big Data on serverless, showing working examples and how to set up a CI/CD pipeline, demonstrating AWS Lambda with the Serverless Application Model (SAM).

54:50
Data Science Follow 457 Followers

Scio: Moving Big Data to Google Cloud, a Spotify Story

Posted by Neville Li  on  May 26, 2017 Posted by Neville Li Follow  Followers  on  May 26, 2017

Neville Li tells the Spotify’s story of migrating their big data infrastructure to Google Cloud, replacing Hive and Scalding with BigQuery and Scio, which helped them iterate faster.

45:00
Data Science Follow 457 Followers

Data Preparation for Data Science: A Field Guide

Posted by Casey Stella  on  Apr 23, 2017 Posted by Casey Stella Follow 0 Followers  on  Apr 23, 2017

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

42:48
Data Science Follow 457 Followers

AI from an Investment Perspective

Posted by Sanjit Dang  on  Apr 18, 2017 Posted by Sanjit Dang Follow 0 Followers , Kiersten Stead Follow 0 Followers , Yashwanth Hemaraj Follow 0 Followers , Pankaj Mitra Follow 0 Followers , Leonard Speiser Follow 0 Followers , Kartik Gada Follow 0 Followers , Doug Dooley Follow 0 Followers  on  Apr 18, 2017

The panelists discuss AI from an investment perspective, the challenges, the risks, trends, the role of Deep Learning, successful AI use cases, and more.

50:48
Data Science Follow 457 Followers

Big Data Infrastructure @ LinkedIn

Posted by Shirshanka Das  on  Apr 02, 2017 Posted by Shirshanka Das Follow 0 Followers  on  Apr 02, 2017

Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.

47:03
Data Science Follow 457 Followers

Real-Time Recommendations Using Spark Streaming

Posted by Elliot Chow  on  Mar 30, 2017 Posted by Elliot Chow Follow 0 Followers  on  Mar 30, 2017

Elliot Chow discusses the data pipeline that they built with Kafka, Spark Streaming, and Cassandra to process Netflix user activities in real time for the Trending Now row.

49:06
Data Science Follow 457 Followers

Building a Data Science Capability from Scratch

Posted by Victor Hu  on  Mar 23, 2017 Posted by Victor Hu Follow 0 Followers  on  Mar 23, 2017

Victor Hu covers the challenges, both technical and cultural, of building a data science team and capability in a large, global company.

40:48
Data Science Follow 457 Followers

Data Science in the Cloud @StitchFix

Posted by Stefan Krawczyk  on  Feb 17, 2017 Posted by Stefan Krawczyk Follow 0 Followers  on  Feb 17, 2017

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

45:26
DevOps Follow 416 Followers

Petabytes Scale Analytics Infrastructure @Netflix

Posted by Tom Gianos  on  Feb 15, 2017 Posted by Tom Gianos Follow 0 Followers , Dan Weeks Follow 0 Followers  on  Feb 15, 2017

Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.

01:02:53
Data Science Follow 457 Followers

Big Data in the Real World: Technology and Use Cases

Posted by Mike Olson  on  Feb 09, 2017 Posted by Mike Olson Follow 0 Followers  on  Feb 09, 2017

Mike Olson presents several use cases where big data is collected and analyzed to gather insights from the automotive, insurance, financial, and other sectors.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT