BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Older rss
46:58
Data Science Follow 278 Followers

Scaling with Apache Spark

Posted by Holden Karau  on  Aug 05, 2017 Posted by Holden Karau Follow 3 Followers  on  Aug 05, 2017

Holden Karau looks at Apache Spark from a performance/scaling point of view and what’s needed to handle large datasets.

50:45
Architecture & Design Follow 620 Followers

Serverless Design Patterns with AWS Lambda: Big Data with Little Effort

Posted by Tim Wagner  on  Jul 29, 2017 Posted by Tim Wagner Follow 2 Followers  on  Jul 29, 2017

Tim Wagner discusses Big Data on serverless, showing working examples and how to set up a CI/CD pipeline, demonstrating AWS Lambda with the Serverless Application Model (SAM).

54:50
Data Science Follow 278 Followers

Scio: Moving Big Data to Google Cloud, a Spotify Story

Posted by Neville Li  on  May 26, 2017 Posted by Neville Li Follow  Followers  on  May 26, 2017

Neville Li tells the Spotify’s story of migrating their big data infrastructure to Google Cloud, replacing Hive and Scalding with BigQuery and Scio, which helped them iterate faster.

45:00
Data Science Follow 278 Followers

Data Preparation for Data Science: A Field Guide

Posted by Casey Stella  on  Apr 23, 2017 Posted by Casey Stella Follow 0 Followers  on  Apr 23, 2017

Casey Stella presents a utility written with Apache Spark to automate data preparation, discovering missing values, values with skewed distributions and discovering likely errors within data.

42:48
Data Science Follow 278 Followers

AI from an Investment Perspective

Posted by Sanjit Dang  on  Apr 18, 2017 Posted by Sanjit Dang Follow 0 Followers , Kiersten Stead Follow 0 Followers , Yashwanth Hemaraj Follow 0 Followers , Pankaj Mitra Follow 0 Followers , Leonard Speiser Follow 0 Followers , Kartik Gada Follow 0 Followers , Doug Dooley Follow 0 Followers  on  Apr 18, 2017

The panelists discuss AI from an investment perspective, the challenges, the risks, trends, the role of Deep Learning, successful AI use cases, and more.

50:48
Data Science Follow 278 Followers

Big Data Infrastructure @ LinkedIn

Posted by Shirshanka Das  on  Apr 02, 2017 Posted by Shirshanka Das Follow 0 Followers  on  Apr 02, 2017

Shirshanka Das describes LinkedIn’s Big Data Infrastructure and its evolution through the years, including details on the motivation and architecture of Gobblin, Pinot and WhereHows.

47:03
Data Science Follow 278 Followers

Real-Time Recommendations Using Spark Streaming

Posted by Elliot Chow  on  Mar 30, 2017 Posted by Elliot Chow Follow 0 Followers  on  Mar 30, 2017

Elliot Chow discusses the data pipeline that they built with Kafka, Spark Streaming, and Cassandra to process Netflix user activities in real time for the Trending Now row.

49:06
Data Science Follow 278 Followers

Building a Data Science Capability from Scratch

Posted by Victor Hu  on  Mar 23, 2017 Posted by Victor Hu Follow 0 Followers  on  Mar 23, 2017

Victor Hu covers the challenges, both technical and cultural, of building a data science team and capability in a large, global company.

40:48
Data Science Follow 278 Followers

Data Science in the Cloud @StitchFix

Posted by Stefan Krawczyk  on  Feb 17, 2017 Posted by Stefan Krawczyk Follow 0 Followers  on  Feb 17, 2017

Stefan Krawczyk discusses how StitchFix used the cloud to enable over 80 data scientists to be productive and have easy access, covering prototyping, algorithms used, keeping schema in sync, etc.

45:26
DevOps Follow 235 Followers

Petabytes Scale Analytics Infrastructure @Netflix

Posted by Tom Gianos  on  Feb 15, 2017 Posted by Tom Gianos Follow 0 Followers , Dan Weeks Follow 0 Followers  on  Feb 15, 2017

Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.

01:02:53
Data Science Follow 278 Followers

Big Data in the Real World: Technology and Use Cases

Posted by Mike Olson  on  Feb 09, 2017 Posted by Mike Olson Follow 0 Followers  on  Feb 09, 2017

Mike Olson presents several use cases where big data is collected and analyzed to gather insights from the automotive, insurance, financial, and other sectors.

38:49
Data Science Follow 278 Followers

Using Bayesian Optimization to Tune Machine Learning Models

Posted by Scott Clark  on  Feb 07, 2017 Posted by Scott Clark Follow 0 Followers  on  Feb 07, 2017

Scott Clark introduces Bayesian Global Optimization as an efficient way to optimize ML model parameters, explaining the underlying techniques and comparing it to other standard methods.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT