Hive Content on InfoQ
Interviews about Hive
Optimizing for Big Data at Facebook
Apr 17, 2012
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
News about Hive
Greenplum Pivotal HD Combines the Strengths of SQL and Hadoop by Abel Avram Posted on Feb 27, 2013
Competition between Real-time Hadoop Implementations Heats Up by Boris Lublinsky Posted on Feb 25, 2013 7
Presentations about Hive
Big Data Platform as a Service at Netflix
Nov 18, 2013
Jeff Magnusson takes a deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick).
Petabyte Scale Data at Facebook
Dec 17, 2012
Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.