• Followers

    Interview and Video Review: Working with Big Data: Infrastructure, Algorithms, and Visualizations

    by Aslan Brooke Follow 0 Followers on  May 02, 2013

    Paul Dix leads a practical exploration into Big Data in this video training series. The first five lessons of the training span multiple server systems with a focus on the end to end processing of large quantities of XML data from real Stack Exchange posts. He completes the training with a lesson on developing visualizations for gaining insights from the macro level analysis of Big Data.

  • Followers

    Hadoop Virtual Panel

    by Boris Lublinsky Follow 0 Followers on  Nov 20, 2012

    In this virtual panel, InfoQ talks to several Hadoop vendors and users about their views at current and future state of Hadoop and the things that are the most important for Hadoop’s further adoption and success.

  • Followers

    Interview with Arun Murthy on Apache YARN

    by Boris Lublinsky Follow 0 Followers on  Aug 17, 2012 1

    Apache Hadoop YARN – a new Hadoop resource manager - has just been promoted to a high level Hadoop subproject. InfoQ had the chance to discuss YARN with Arun Murthy - founder and architect at Hortonworks.


Generating Avro Schemas from XML Schemas Using JAXB

Posted by Benjamin Fagin Follow 0 Followers on  Mar 06, 2012

In his new article Benjamin Fagin discuss how to write XJC plugins and use this technique to generate AVRO schemes and marshaling classes directly from existing XSD files 7


Exploring Hadoop OutputFormat

Posted by Jim.Blomo Follow 0 Followers on  Dec 07, 2011

Usage of custom Hadoop OutputFormat allows to produce output data in a form most appropriate for other applications. 4


Extending Oozie

Posted by Boris Lublinsky Follow 0 Followers , Mike Segel Follow 0 Followers on  Aug 02, 2011

In this article authors show how to extend Oozie by introducing custom actions, specific for a given company/line of business. 10


An Open, Interoperable Cloud

Posted by Andy Edmonds Follow 0 Followers , Thijs Metsch Follow 0 Followers , Eugene Luster Follow 0 Followers on  Jul 19, 2011

This article describes how interoperable clouds can be created, today, through the integration of open standards such as the Open Cloud Compute Interface, the Open Virtualisation Format and CDMI. 3


Oozie by Example

Posted by Boris Lublinsky Follow 0 Followers , Mike Segel Follow 0 Followers on  Jul 18, 2011

Complete Oozie example, demonstrating language features and their usage in real world examples 2


Introduction to Oozie

Posted by Boris Lublinsky Follow 0 Followers , Michael Segel Follow 0 Followers on  Jul 12, 2011

Introduction to Oozie and some of the ways it can be used. 7


Using Apache Avro

Posted by Boris Lublinsky Follow 0 Followers on  Jan 25, 2011

A new marshaling framework - Apache Avro provides a lot of interesting new features. In his new article, Boris Lublinsky takes it for a test drive and provides some suggestions on its proper usage 5


Data Mining in the Swamp: Taming Unruly Data With Cloud Computing

Posted by John Brothers Follow 0 Followers on  Aug 13, 2010

Matrix presents a white paper on using the open source tool, Hadoop, to implement the MapReduce strategy and a Cloud computing strategy to solve business intelligence problems. 1


Clojure and Rails - the Secret Sauce Behind FlightCaster

Posted by Werner Schuster Follow 4 Followers on  Aug 20, 2009

FlightCaster, a flight delay prediction site, uses Clojure and Hadoop for the statistical analysis, with a Rails/Heroku frontend. We talked to Bradford Cross about Clojure and functional programming. 1

Login to InfoQ to interact with what matters most to you.

Recover your password...


Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.


More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.


Stay up-to-date

Set up your notifications and don't miss out on content that matters to you