BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Older Newer rss
45:26

Petabytes Scale Analytics Infrastructure @Netflix

Posted by Tom Gianos  on  Feb 15, 2017 Posted by Tom Gianos Dan Weeks  on  Feb 15, 2017

Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.

01:02:53

Big Data in the Real World: Technology and Use Cases

Posted by Mike Olson  on  Feb 09, 2017 Posted by Mike Olson  on  Feb 09, 2017

Mike Olson presents several use cases where big data is collected and analyzed to gather insights from the automotive, insurance, financial, and other sectors.

38:49

Using Bayesian Optimization to Tune Machine Learning Models

Posted by Scott Clark  on  Feb 07, 2017 Posted by Scott Clark  on  Feb 07, 2017

Scott Clark introduces Bayesian Global Optimization as an efficient way to optimize ML model parameters, explaining the underlying techniques and comparing it to other standard methods.

32:49

Machine Learning and End-to-End Data Analysis Processes in Spark Using Python and R

Posted by Debraj GuhaThakurta  on  Feb 05, 2017 Posted by Debraj GuhaThakurta  on  Feb 05, 2017

Debraj GuhaThakurta discusses ML and data analysis processes in Spark using examples written in Python and R.

50:44

Java (SE) State of the Union

Posted by Gil Tene  on  Jan 17, 2017 Posted by Gil Tene  on  Jan 17, 2017

Gil Tene presents the current state of Java SE and OpenJDK, the role of Java in the Big Data and Infrastructure components, JCP, the ecosystem, trends, etc.

01:10:46

Cloud Native Streaming and Event-driven Microservices

Posted by Marius Bogoevici  on  Jan 14, 2017 Posted by Marius Bogoevici  on  Jan 14, 2017

Marius Bogoevici demonstrates how to create complex data processing pipelines that bridge the big data and enterprise integration together and how to orchestrate them with Spring Cloud Data Flow.

55:24

Spring and Big Data

Posted by Thomas Risberg  on  Jan 08, 2017 Posted by Thomas Risberg  on  Jan 08, 2017

Thomas Risberg discusses developing big data pipelines with Spring, focusing around the code needed and he also covers how to set up a test environment both locally and in the cloud.

21:27

Uses of Big Data by a Non-Profit Engaged in Conducting Events Funded in Part by Third Party Sponsors

Posted by Thomas Grilk  on  Dec 28, 2016 Posted by Thomas Grilk  on  Dec 28, 2016

Thomas Grilk discusses how a non-profit can efficiently use data from customers/athletes in its marketing and sponsorship activities while respecting the privacy and confidentiality of its customers.

31:03

TensorFlow: A Flexible, Scalable & Portable System

Posted by Rajat Monga  on  Dec 17, 2016 Posted by Rajat Monga  on  Dec 17, 2016

Rajat Monga talks about why Google built TensorFlow, an open source software library for numerical computation using data flow graphs, and what were some of the technical challenges in building it.

50:31

Visual Rules of the Road for Big Data Practitioners

Posted by David Fisher  on  Dec 16, 2016 Posted by David Fisher  on  Dec 16, 2016

David Fisher discusses via example how to build a data navigation language into visualizations, providing an intuitive user experience via the mechanism of subtle visual cuing.

25:32

Validation Methodology of Large Unstructured Unsupervised Learning Systems

Posted by Lawrence Chernin  on  Nov 26, 2016 Posted by Lawrence Chernin  on  Nov 26, 2016

Lawrence Chernin describes best practices and validation methods used to deal with large unstructured data, including a suite of unit tests covering the implementations of algorithmic equations.

34:47

How Predictive Analytics Boosts the Customer Experience at the Georgia Aquarium

Posted by Beach Clark  on  Nov 20, 2016 Posted by Beach Clark  on  Nov 20, 2016

Beach Clark talks about the technological and cultural challenges of turning data science into a vital part of the business model at Georgia Aquarium.

BT