Clusters Content on InfoQ

Articles about Clusters

Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution. by Boris Lublinsky, Mike Segel Posted on Nov 04, 2011 In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.

Interviews about Clusters

Optimizing for Big Data at Facebook by Ashish Thusoo Posted on Apr 17, 2012 Hive co-creator Ashish Thusoo describes the Big Data challenges Facebook faced and presents solutions in 2 areas: Reduction in the data footprint and CPU utilization. Generating 300 to 400 terabytes per day, they store RC files as blocks, but store as columns within a block to get better compression. He also talks about the current Big Data ecosystem and trends for companies going forward.

Presentations about Clusters

Scaling Pinterest by Yashwanth Nelapati, Marty Weiner Posted on Mar 26, 2013 Yashwanth Nelapati and Marty Weiner share lessons learned growing Pinterest: sharding MySQL, caching, server management, all on Amazon EC2.

Membase NoSQL: Clustered by Erlang by Sean Lynch and Matt Ingenthron Posted on Sep 12, 2011 Sean Lynch and Matt Ingenthron introduce Membase, detailing how they added clustering features in Erlang, what they built and what lessons they leaned along the way.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy