BT
Older Newer rss

Introducing Apache Hadoop: The Modern Data Operating System

Posted by Eli Collins  on  Dec 18, 2012 2

Eli Collins introduces Hadoop: why it came about, the benefits it produces, its history, its architecture, use cases and applications.

Petabyte Scale Data at Facebook

Posted by Dhruba Borthakur  on  Dec 17, 2012 3

Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.

Big Time: Introducing Hadoop on Azure

Posted by Yaniv Rodenski  on  Nov 21, 2012

Yaniv Rodenski introduces Hadoop, then running Hadoop on Azure and the available tools and frameworks.

MapReduce and Its Discontents

Posted by Dean Wampler  on  Oct 05, 2012 1

Dean Wampler discusses the strengths and weaknesses of MapReduce, and the newer variants for big data processing: Pregel and Storm.

Hadoop: Scalable Infrastructure for Big Data

Posted by Parand Tony Darugar  on  Sep 07, 2012 3

Parand Tony Darugar overviews Hadoop, its processing model, the associated ecosystem and tools, discussing some real-life uses of Hadoop for analyzing and processing large amounts of data.

Big Data Architectures at Facebook

Posted by Ashish Thusoo  on  Jul 18, 2012

Ashish Thusoo presents the data scalability issues at Facebook and the data architecture evolution from EDW to Hadoop to Puma.

NetApp Case Study

Posted by Kumar Palaniapan and Scott Fleming  on  Jun 01, 2012 1

Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.

Hadoop and Cassandra, Sitting in a Tree ...

Posted by Jake Luciani  on  May 30, 2012

Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.

Grid Gain vs. Hadoop. Why Elephants Can't Fly

Posted by Dmitriy Setrakyan  on  May 16, 2012 3

Dmitriy Setrakyan introduces GridGain, comparing it and outlining the cases where it is a better fit than Hadoop, accompanied by a live demo showing how to set up a GridGain job.

Distributed Data Analysis with Hadoop and R

Posted by Jonathan Seidman and Ramesh Venkataramaiah  on  Mar 09, 2012 2

Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.

Panel: Hadoop for the Enterprise Architect

Posted by Peter Sirota, Amr Awadallah, Eric Baldeschwieler, Ted Dunning, Guy Bayes, and Ron Bodkin  on  Feb 16, 2012 5

Peter Sirota, Amr Awadallah, Eric Baldeschwieler, Ted Dunning, Guy Bayes, and moderator Ron Bodkin discuss various existing Hadoop use cases, ecosystems, and disaster recovery.

NoSQL at Twitter

Posted by Ryan King  on  Jan 21, 2011

Ryan King presents how Twitter uses NoSQL technologies - Gizzard, Cassandra, Hadoop, Redis - to deal with increasing data amounts forcing them to scale out beyond what the traditional SQL has to offer.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT