BT
rss
38:11

REEF: Retainable Evaluator Execution Framework

Posted by Rusty Sears  on  Dec 10, 2013

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

38:16

Apache Tez: Accelerating Hadoop Query Processing

Posted by Bikas Saha, Arun Murthy  on  Dec 05, 2013

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

52:24

Big Data Platform as a Service at Netflix

Posted by Jeff Magnusson  on  Nov 18, 2013

Jeff Magnusson takes a deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick).

Petabyte Scale Data at Facebook

Posted by Dhruba Borthakur  on  Dec 17, 2012 3

Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.

Hadoop and Cassandra, Sitting in a Tree ...

Posted by Jake Luciani  on  May 30, 2012

Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT