BT
Older rss
44:45

From The Lab To The Factory: Building A Production Machine Learning Infrastructure

Posted by Josh Wills  on  Jan 16, 2014

Josh Wills discusses using Hadoop technologies to build real-time data analysis models with a focus on strategies for data integration, large-scale machine learning, and experimentation.

46:49

Data & Infrastructure at Airbnb

Posted by Brenden Matthews  on  Dec 31, 2013

Brenden Matthews describes the infrastructure built at Airbnb using Mesos in order to support Hadoop and Storm.

36:51

Graph Computing at Scale

Posted by Matthias Broecheler  on  Dec 27, 2013

Matthias Broecheler discusses graph computing, introducing the Aurelius graph cluster enabling graph computing at scale by building on distributed systems like Cassandra, HBase, and Hadoop.

38:11

REEF: Retainable Evaluator Execution Framework

Posted by Rusty Sears  on  Dec 10, 2013

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

38:16

Apache Tez: Accelerating Hadoop Query Processing

Posted by Bikas Saha, Arun Murthy  on  Dec 05, 2013

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

52:24

Big Data Platform as a Service at Netflix

Posted by Jeff Magnusson  on  Nov 18, 2013

Jeff Magnusson takes a deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick).

53:38

High Speed Smart Data Ingest into Hadoop

Posted by Oleg Zhurakousky  on  Oct 24, 2013

Oleg Zhurakousky discusses architectural tradeoffs and alternative implementations of real-time high speed data ingest into Hadoop.

28:12

A Guide to Python Frameworks for Hadoop

Posted by Uri Laserson  on  Oct 03, 2013

Uri Laserson reviews the different available Python frameworks for Hadoop, including a comparison of performance, ease of use/installation, differences in implementation, and other features.

35:50

Leveraging Your Hadoop Cluster Better - Running Performant Code at Scale

Posted by Michael Kopp  on  Aug 16, 2013

Michael Kopp explains how to run performance code at scale with Hadoop and how to analyze and optimize Hadoop jobs.

30:33

Building Applications using Apache Hadoop

Posted by Eli Collins  on  Aug 11, 2013

Eli Collins overviews how to build new applications with Hadoop and how to integrate Hadoop with existing applications, providing an update on the state of Hadoop ecosystem, frameworks and APIs.

46:43

Copious Data, the "Killer App" for Functional Programming

Posted by Dean Wampler  on  Aug 03, 2013 2

Dean Wampler supports using Functional Programming and its core operations to process large amounts of data, explaining why Java’s dominance in Hadoop is harming Big Data’s progress.

40:51

Cascalog: Logic Programming over Hadoop

Posted by Alex Robbins  on  Jun 28, 2013

Alex Robbins introduces Cascalog, a Clojure library for writing declarative Hadoop jobs.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT