BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

You are now in FULL VIEW
CLOSE FULL VIEW

Facebook’s Petabyte Scale Data Warehouse using Hive and Hadoop
Recorded at:

| by Ashish Thusoo Follow 0 Followers , Namit Jain Follow 0 Followers on Feb 21, 2010 | NOTICE: The next QCon is in San Francisco Nov 13-17, 2017. Join us!
58:26

Summary
Ashish Thusoo and Namit Jain explain how Facebook manages to deal with 12 TB of compressed new data everyday with Hive’s help. Hive is an open source data warehousing framework built on Hadoop, allowing developers to perform analysis against large datasets using SQL.

Sponsored Content

Bio

Ashish Thusoo is currently managing the Facebook data infrastructure team. He is the project leader of Hive at Apache and a member of Hadoop PMC. Namit Jain is a member of Facebook’s data-infrastructure group and he is a committer for Hive. He also worked for over 10 years at Oracle on streaming technologies, XML, replication and queuing.

QCon is a conference that is organized by the community, for the community.The result is a high quality conference experience where a tremendous amount of attention and investment has gone into having the best content on the most important topics presented by the leaders in our community. QCon is designed with the technical depth and enterprise focus of interest to technical team leads, architects, and project managers.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT