InfoQ Homepage Hive Content on InfoQ
Interviews
RSS Feed-
Dean Wampler on Scalding, NoSQL, Scala, Functional Programming and Big Data
Dean Wampler explains Scalding and the other Hadoop support libraries, the return of SQL, how (big) data is the killer application for functional programming, Java 8 vs Scala, and much more.
-
Optimizing for Big Data at Facebook
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.