Clusters Content on InfoQ
News about Clusters
How Heroku Manages High Availability - QCon London Talk summary by Fabian Lange Posted on Mar 23, 2012
Articles about Clusters
Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution.
by
Boris Lublinsky, Mike Segel
Posted on
Nov 04, 2011
In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.
Interviews about Clusters
Optimizing for Big Data at Facebook
by
Ashish Thusoo
Posted on
Apr 17, 2012
Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.
Presentations about Clusters
Scaling Pinterest
by
Yashwanth Nelapati, Marty Weiner
Posted on
Mar 26, 2013
Yashwanth Nelapati and Marty Weiner share lessons learned growing Pinterest: sharding MySQL, caching, server management, all on Amazon EC2.
Membase NoSQL: Clustered by Erlang
by
Sean Lynch and Matt Ingenthron
Posted on
Sep 12, 2011
Sean Lynch and Matt Ingenthron introduce Membase, detailing how they added clustering features in Erlang, what they built and what lessons they leaned along the way.



