Clusters Content on InfoQ
News about Clusters
How Heroku Manages High Availability - QCon London Talk summary by Fabian Lange Posted on Mar 23, 2012
Articles about Clusters
Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution.
Boris Lublinsky, Mike Segel
Nov 04, 2011
In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.
Interviews about Clusters
Optimizing for Big Data at Facebook
Apr 17, 2012
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
Presentations about Clusters
Yashwanth Nelapati, Marty Weiner
Mar 26, 2013
Yashwanth Nelapati and Marty Weiner share lessons learned growing Pinterest: sharding MySQL, caching, server management, all on Amazon EC2.