BT
  • Building Applications With Hadoop

    by Roopesh Shenoy on  Jan 30, 2014

    When building applications using Hadoop, it is common to have input data from various sources coming in various formats. In his presentation, “New Tools for Building Applications on Apache Hadoop”, Eli Collins overviews how to build better products with Hadoop and various tools that can help, such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit.

  • Building a Real-time, Personalized Recommendation System with Kiji

    by Jon Natkins on  Dec 26, 2013

    Jon Natkins explains in this article how to create a personalized recommendation system fed with large amounts of real-time data using Kiji, which leverages HBase, Avro, Map-Reduce and Scalding.

  • Costin Leau on Elasticsearch, BigData and Hadoop

    by Srini Penchikala on  Nov 15, 2013

    Elasticsearch is an open source, distributed real-time search and analytics engine for the cloud. The first milestone of elasticsearch-hadoop 1.3.M1 was released last month. InfoQ spoke with Costin Leau about Elasticsearch and how it integrates with Hadoop and other Big Data technologies.

Spoilt for Choice – How to choose the right Big Data / Hadoop Platform?

Posted by Kai Wähner on  Jul 09, 2013

Although Hadoop is a set of an open source Apache (and now GitHub) projects, there are currently a large number of alternatives for installing a version of Hadoop and realizing big data processes. 3

Interview and Video Review: Working with Big Data: Infrastructure, Algorithms, and Visualizations

Posted by Aslan Brooke on  May 02, 2013

Paul Dix leads a practical exploration into Big Data in this video training series. The training focuses on the high level architecture while teaching practical usage skills and Ruby algorithms.

Hadoop Virtual Panel

Posted by Boris Lublinsky on  Nov 20, 2012

In this virtual panel, InfoQ talks to several Hadoop vendors and users about their views at current and future state of Hadoop.

Interview with Arun Murthy on Apache YARN

Posted by Boris Lublinsky on  Aug 17, 2012

Apache Hadoop YARN – a new Hadoop resource manager - has just been promoted to a high level Hadoop subproject. InfoQ had the chance to discuss YARN with Arun Murthy - founder of Hortonworks. 1

Generating Avro Schemas from XML Schemas Using JAXB

Posted by Benjamin Fagin on  Mar 06, 2012

In his new article Benjamin Fagin discuss how to write XJC plugins and use this technique to generate AVRO schemes and marshaling classes directly from existing XSD files 9

Exploring Hadoop OutputFormat

Posted by Jim.Blomo on  Dec 07, 2011

Usage of custom Hadoop OutputFormat allows to produce output data in a form most appropriate for other applications. 2

Extending Oozie

Posted by Boris Lublinsky, Mike Segel on  Aug 02, 2011

In this article authors show how to extend Oozie by introducing custom actions, specific for a given company/line of business. 6

An Open, Interoperable Cloud

Posted by Andy Edmonds, Thijs Metsch, Eugene Luster on  Jul 19, 2011

This article describes how interoperable clouds can be created, today, through the integration of open standards such as the Open Cloud Compute Interface, the Open Virtualisation Format and CDMI. 3

Oozie by Example

Posted by Boris Lublinsky, Mike Segel on  Jul 18, 2011

Complete Oozie example, demonstrating language features and their usage in real world examples 2

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2015 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT