BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

Older rss
46:03

Scaling up Near Real-Time Analytics @Uber &LinkedIn

Posted by Chinmay Soman  on  Mar 30, 2017 Posted by Chinmay Soman Yi Pan  on  Mar 30, 2017

Chinmay Soman and Yi Pan discuss how Uber and LinkedIn use Apache Samza, Calcite and Pinot along with the analytics platform AthenaX to transform data to make it available for querying in minutes.

47:47

Stream Processing & Analytics with Flink @Uber

Posted by Danny Yuan  on  Mar 25, 2017 Posted by Danny Yuan  on  Mar 25, 2017

Danny Yuan discusses how Uber builds its next generation of stream processing system to support real-time analytics as well as complex event processing.

49:31

Data Cleansing and Understanding Best Practices

Posted by Casey Stella  on  Mar 23, 2017 Posted by Casey Stella  on  Mar 23, 2017

Casey Stella talks about discovering missing values, values with skewed distributions and likely errors within data, as well as a novel approach to finding data interconnectedness.

43:06

Elastic Data Analytics Platform @Datadog

Posted by Doug Daniels  on  Feb 17, 2017 1 Posted by Doug Daniels  on  Feb 17, 2017 1

Doug Daniels discusses the cloud-based platform they have built at DataDog and how it differs from a traditional datacenter-based analytics stack, pros and cons and the tooling built.

33:53

Streaming Live Data and the Hadoop Ecosystem

Posted by Oleg Zhurakousky  on  Jan 29, 2017 Posted by Oleg Zhurakousky  on  Jan 29, 2017

Oleg Zhurakousky discusses the Hadoop ecosystem – Hadoop, HDFS, Yarn-, and how projects such as Hive, Atlas, NiFi interact and integrate to support the variety of data used for analytics.

50:21

Scaling Counting Infrastructure @Quora

Posted by Chun-Ho Hung  on  Jan 22, 2017 Posted by Chun-Ho Hung Nikhil Garg  on  Jan 22, 2017

Chun-Ho Hung and Nikhil Garg discuss Quanta, Quora's counting system powering their high-volume near-real-time analytics, describing the architecture, design goals, constraints, and choices made.

43:45

Forecasting Using Data - Quickly Answering How Big, How Long and How Likely

Posted by Troy Magennis  on  Dec 16, 2016 1 Posted by Troy Magennis  on  Dec 16, 2016 1

Troy Magennis explains in this workshop how to capture data and use it for reliable project forecasting using a practical and simple approach to forecasting without item effort estimation.

25:32

Validation Methodology of Large Unstructured Unsupervised Learning Systems

Posted by Lawrence Chernin  on  Nov 26, 2016 Posted by Lawrence Chernin  on  Nov 26, 2016

Lawrence Chernin describes best practices and validation methods used to deal with large unstructured data, including a suite of unit tests covering the implementations of algorithmic equations.

38:53

Developing a Machine Learning Based Predictive Analytics Engine for Big Data Analytics

Posted by Ali Jalali  on  Oct 16, 2016 Posted by Ali Jalali  on  Oct 16, 2016

Ali Jalali presents how to develop a machine learning predictive analytics engine for big data analytics.

30:06

The Joy of Analysis Development

Posted by Hilary Parker  on  Sep 06, 2016 Posted by Hilary Parker  on  Sep 06, 2016

Hilary Parker discusses the history of the analysis development tools, the current state of the art, and the importance for data scientists and analysts to understand programming principles.

38:06

Structuring Data for Self-Serve Customer Insights

Posted by Jim Porzak  on  Aug 12, 2016 Posted by Jim Porzak  on  Aug 12, 2016

Jim Porzak discusses creating an analyst ready data mart that is complete at different levels of abstraction and models customer decision points in order to be able to understand customers.

42:50

Applying Big Data

Posted by Graeme Seaton  on  Aug 07, 2016 Posted by Graeme Seaton  on  Aug 07, 2016

Graeme Seaton discusses the drivers behind Big Data initiatives and how to approach them using the vast amounts of data available.

BT