InfoQ Homepage Hadoop Content on InfoQ
-
Oozie by Example
End to end Oozie example, including process design, resource coordinator and workflow implementation
-
Introduction to Oozie
Basic introduction to Oozie - a framework allowing to combine multiple Map/Reduce jobs into a logical unit of work.
-
Using Apache Avro
Boris Lublinsky presents an introduction to AVRO and evaluate its usage for Schema componentization, inheritance and polymorphism. He also discusses backward compatibility issues and AVRO solutions for this problem.
-
Data Mining in the Swamp: Taming Unruly Data With Cloud Computing
Matrix presents a white paper on using the open source tool, Hadoop, to implement the MapReduce strategy and a Cloud computing strategy to solve business intelligence problems.
-
Clojure and Rails - the Secret Sauce Behind FlightCaster
FlightCaster, a realtime flight delay site, is built on Clojure and Hadoop for the statistical analysis. The web frontend is built with Ruby on Rails and hosted on Heroku. We talked to Bradford Cross about Clojure, functional programming and tips for OOP developers interested in making the jump.
-
Yahoo's Doug Cutting on MapReduce and the Future of Hadoop
InfoQ's lead Java editor, Scott Delap, recently caught up with Hadoop project lead Doug Cutting. Hadoop is an open source distributed computing platform that includes implementations of MapReduce and a distributed file system. In this special InfoQ interview Cutting discusses how Hadoop is used at Yahoo, the challenges of its development, and the future direction of the project.