BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

rss

Optimizing for Big Data at Facebook

Interview with Ashish Thusoo on  Apr 17, 2012

Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.

BT