As more companies adopt Hadoop, its integration with other applications is becoming more important. A key to such integration is usage of the appropriate OutputFormat allowing to produce output data in a form most appropriate for other applications.
In this article authors show how leverage Oozie extensibility to implement custom language extensions. This approach can be viewed a specializing workflow language for a given company/line of business.
This article describes how interoperable clouds can be created, today, through the integration of open standards such as the Open Cloud Compute Interface, the Open Virtualisation Format and CDMI. They provide the means to package virtual infrastructure deployments, an API for the runtime management of storage infrastructure and an API for the runtime management of infrastructure as service.
Complete Oozie example, demonstrating language features and their usage in real world examples 2
A new marshaling framework - Apache Avro provides a lot of interesting new features. In his new article, Boris Lublinsky takes it for a test drive and provides some suggestions on its proper usage 4
Matrix presents a white paper on using the open source tool, Hadoop, to implement the MapReduce strategy and a Cloud computing strategy to solve business intelligence problems. 1
FlightCaster, a flight delay prediction site, uses Clojure and Hadoop for the statistical analysis, with a Rails/Heroku frontend. We talked to Bradford Cross about Clojure and functional programming. 1
In this special InfoQ interview, Hadoop project lead Doug Cutting discusses MapReduce, the benefits of open source, and the future direction of the project. 2