Hive Content on InfoQ
Interviews about Hive
Optimizing for Big Data at Facebook
by
Ashish Thusoo
Posted on
Apr 17, 2012
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
Presentations about Hive
Petabyte Scale Data at Facebook
by
Dhruba Borthakur
Posted on
Dec 17, 2012
Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.
Hadoop and Cassandra, Sitting in a Tree ...
by
Jake Luciani
Posted on
May 30, 2012
Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.
News about Hive
Greenplum Pivotal HD Combines the Strengths of SQL and Hadoop by Abel Avram Posted on Feb 27, 2013
Competition between Real-time Hadoop Implementations Heats Up by Boris Lublinsky Posted on Feb 25, 2013
Yahoo Hadoop Spinout Hortonworks Announces Plans by Ron Bodkin Posted on Jun 29, 2011



