InfoQ Homepage Big Data Content on InfoQ
-
Uncovering mysteries of InputFormat: Providing better control for your Map Reduce execution.
In their article authors, Boris Lublinsky and Mike Segel, show how to leverage custom InputFormat class implementation to tighter control execution strategy of Maps in Hadoop Map Reduce jobs.
-
Extending Oozie
In this article authors show how leverage Oozie extensibility to implement custom language extensions. This approach can be viewed a specializing workflow language for a given company/line of business.
-
Oozie by Example
End to end Oozie example, including process design, resource coordinator and workflow implementation
-
Data Mining in the Swamp: Taming Unruly Data With Cloud Computing
Matrix presents a white paper on using the open source tool, Hadoop, to implement the MapReduce strategy and a Cloud computing strategy to solve business intelligence problems.
-
SOA Agents: Grid Computing meets SOA
Grid technology for improving scalability, high availability and throughput in SOA implementations. In this article, Boris Lublinsky explains how Grid computing can be used in the overall SOA architecture and introduces a programming model for Grid utilization in service implementation. He also introduces an experimental Grid implementation that can support this proposed architecture.