InfoQ Homepage Distributed Systems Content on InfoQ

Presentations

RSS Feed

Newer Older

Big Time: Introducing Hadoop on Azure

Yaniv Rodenski introduces Hadoop, then running Hadoop on Azure and the available tools and frameworks.

Yaniv Rodenski
on Nov 21, 2012

Icon

55:21
Building Healthy Distributed Systems

Mark Phillips discusses 3 types of distributed systems and how they run them at Basho: Computer Systems, Communities, and Companies.

Mark Phillips
on Nov 07, 2012

Icon

44:59
Embracing Concurrency at Scale

Justin Sheehy discusses designing reliable distributed systems that can scale in order to deal with concurrency problems and the tradeoffs required by such systems.

Justin Sheehy
on Nov 02, 2012

Icon

43:44
MapReduce and Its Discontents

Dean Wampler discusses the strengths and weaknesses of MapReduce, and the newer variants for big data processing: Pregel and Storm.

Dean Wampler
on Oct 05, 2012

Icon

48:41
Hadoop: Scalable Infrastructure for Big Data

Parand Tony Darugar overviews Hadoop, its processing model, the associated ecosystem and tools, discussing some real-life uses of Hadoop for analyzing and processing large amounts of data.

Parand Tony Darugar
on Sep 07, 2012

Icon

58:37
Storm: Distributed and Fault-tolerant Real-time Computation

Nathan Marz discusses Storm concepts –streams, spouts, bolts, topologies-, explaining how to use Storms’ Clojure DSL for real-time stream processing, distributed RPS and continuous computations.

Nathan Marz
on Jul 25, 2012

Icon

42:26
Big Data Architectures at Facebook

Ashish Thusoo presents the data scalability issues at Facebook and the data architecture evolution from EDW to Hadoop to Puma.

Ashish Thusoo
on Jul 18, 2012

Icon

51:17
NetApp Case Study

Kumar Palaniapan and Scott Fleming present how NetApp deals with big data using Hadoop, HBase, Flume, and Solr, collecting and analyzing TBs of log data with Think Big Analytics.

Kumar Palaniapan Scott Fleming
on Jun 01, 2012

Icon

54:21
Hadoop and Cassandra, Sitting in a Tree ...

Jake Luciani introduces Brisk, a Hadoop and Hive distribution using Cassandra for core services and storage, presenting the benefits of running Hadoop in a peer-to-peer masterless architecture.

Jake Luciani
on May 30, 2012

Icon

45:41
Distributed Systems with ZeroMQ and gevent

Jeff Lindsay discusses creating distributed and concurrent systems using ZeroMQ – a lightweight message queue-, and gevent – a coroutine-based networking library.

Jeff Lindsay
on May 22, 2012

Icon

48:36
Grid Gain vs. Hadoop. Why Elephants Can't Fly

Dmitriy Setrakyan introduces GridGain, comparing it and outlining the cases where it is a better fit than Hadoop, accompanied by a live demo showing how to set up a GridGain job.

Dmitriy Setrakyan
on May 16, 2012

Icon

01:04:58
A P2P Digital Self with TeleHash

Jeremie Miller presents how to create a fully distributed data network in which nodes communicate directly with each other using UDP, JSON and Kademlia, without relying on central servers.

Jeremie Miller
on Apr 05, 2012

Icon

50:48

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations