BT
Older Newer rss
45:01

The Next Wave of SQL-on-Hadoop: The Hadoop Data Warehouse

Posted by Marcel Kornacker  on  Jul 09, 2014

Marcel Kornacker presents a case study of an EDW built on Impala running on 45 nodes, reducing processing time from hours to seconds and consolidating multiple data sets into one single view.

53:16

Finding the Needle in a Big Data Haystack

Posted by Eva Andreasson  on  Jul 08, 2014

In this solutions track talk, sponsored by Cloudera, Eva Andreasson discusses how search and Hadoop can help with some of the industry's biggest challenges. She introduces the data hub concept.

44:43

A Big Data Arsenal for the 21st Century

Posted by Matt Asay  on  Jun 12, 2014 2

In this solutions track talk, sponsored by MongoDB, Matt Asay discusses the differences between some of the NoSQL and SQL databases and when Hadoop makes sense to be used with a NoSQL solution.

45:22

Next Gen Hadoop

Posted by Akmal B. Chaudhri  on  Apr 22, 2014

Akmal B. Chaudhri introduces Apache™ Hadoop® 2.0 and Yet Another Resource Negotiator (YARN).

55:02

What Can Hadoop Do for You?

Posted by Eva Andreasson  on  Apr 22, 2014

Eva Andreasson presents typical categories of problems that are commonly solved using Hadoop and also some concrete examples in each category.

43:05

Design Patterns for Large-Scale Real-Time Learning

Posted by Sean Owen  on  Apr 15, 2014

Sean Owen provides examples of operational analytics projects, presenting a reference architecture and algorithm design choices for a successful implementation based on his experience Oryx/Cloudera.

44:45

From The Lab To The Factory: Building A Production Machine Learning Infrastructure

Posted by Josh Wills  on  Jan 16, 2014

Josh Wills discusses using Hadoop technologies to build real-time data analysis models with a focus on strategies for data integration, large-scale machine learning, and experimentation.

46:49

Data & Infrastructure at Airbnb

Posted by Brenden Matthews  on  Dec 31, 2013

Brenden Matthews describes the infrastructure built at Airbnb using Mesos in order to support Hadoop and Storm.

36:51

Graph Computing at Scale

Posted by Matthias Broecheler  on  Dec 27, 2013

Matthias Broecheler discusses graph computing, introducing the Aurelius graph cluster enabling graph computing at scale by building on distributed systems like Cassandra, HBase, and Hadoop.

38:11

REEF: Retainable Evaluator Execution Framework

Posted by Rusty Sears  on  Dec 10, 2013

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

38:16

Apache Tez: Accelerating Hadoop Query Processing

Posted by Bikas Saha, Arun Murthy  on  Dec 05, 2013

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

52:24

Big Data Platform as a Service at Netflix

Posted by Jeff Magnusson  on  Nov 18, 2013

Jeff Magnusson details some of Netflix' key services: Franklin, Sting and Lipstick.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT