BT
Older Newer rss
30:33

Building Applications using Apache Hadoop

Posted by Eli Collins  on  Aug 11, 2013

Eli Collins overviews how to build new applications with Hadoop and how to integrate Hadoop with existing applications, providing an update on the state of Hadoop ecosystem, frameworks and APIs.

46:43

Copious Data, the "Killer App" for Functional Programming

Posted by Dean Wampler  on  Aug 03, 2013 2

Dean Wampler supports using Functional Programming and its core operations to process large amounts of data, explaining why Java’s dominance in Hadoop is harming Big Data’s progress.

40:51

Cascalog: Logic Programming over Hadoop

Posted by Alex Robbins  on  Jun 28, 2013

Alex Robbins introduces Cascalog, a Clojure library for writing declarative Hadoop jobs.

44:36

Running the Largest Hadoop DFS Cluster

Posted by Hairong Kuang  on  Mar 15, 2013 5

Hairong Kuang explains how Facebook uses HDFS to store and analyze over 100PB of user log data.

37:29

Making Hadoop Real Time with Scala & GridGain

Posted by Nikita Ivanov  on  Mar 04, 2013

Nikita Ivanov shows adding real-time capabilities to Hadoop through a demo application streaming word counting on a 2-nodes cluster.

24:56

Building an Impenetrable ZooKeeper

Posted by Kathleen Ting  on  Feb 13, 2013 1

Kathleen Ting details 8 misconfigurations that can bring ZooKeeper down.

01:11:51

How to Build Big Data Pipelines for Hadoop Using OSS

Posted by Costin Leau  on  Feb 08, 2013

Costin Leau discusses Big Data, current available tools for dealing with it, and how Spring can be used to create Big Data pipelines.

Storm: Distributed and Fault-Tolerant Real-time Computation

Posted by Nathan Marz  on  Jan 04, 2013

Nathan Marz introduces Twitter Storm, outlining its architecture and use cases, and takes a look at future features to be made available.

Extending the Enterprise Data Warehouse with Hadoop

Posted by Rob Lancaster  on  Dec 27, 2012

Rob Lancaster explains the steps made by Orbitz in order to bridge the gap between their data in the data warehouse and the data in Hadoop.

Introducing Apache Hadoop: The Modern Data Operating System

Posted by Eli Collins  on  Dec 18, 2012 2

Eli Collins introduces Hadoop: why it came about, the benefits it produces, its history, its architecture, use cases and applications.

Petabyte Scale Data at Facebook

Posted by Dhruba Borthakur  on  Dec 17, 2012 3

Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.

Big Time: Introducing Hadoop on Azure

Posted by Yaniv Rodenski  on  Nov 21, 2012

Yaniv Rodenski introduces Hadoop, then running Hadoop on Azure and the available tools and frameworks.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT