Dean Wampler on Scalding, NoSQL, Scala, Functional Programming and Big Data
Dec 16, 2013
Dean Wampler explains Scalding and the other Hadoop support libraries, the return of SQL, how (big) data is the killer application for functional programming, Java 8 vs Scala, and much more.
Optimizing for Big Data at Facebook
Apr 17, 2012
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
Interactive SQL in Apache Hadoop with Impala and Hive by Alex Giamas Posted on Feb 07, 2014
Greenplum Pivotal HD Combines the Strengths of SQL and Hadoop by Abel Avram Posted on Feb 27, 2013
Competition between Real-time Hadoop Implementations Heats Up by Boris Lublinsky Posted on Feb 25, 2013 7