BT
Older Newer rss
44:45

From The Lab To The Factory: Building A Production Machine Learning Infrastructure

Posted by Josh Wills  on  Jan 16, 2014

Josh Wills discusses using Hadoop technologies to build real-time data analysis models with a focus on strategies for data integration, large-scale machine learning, and experimentation.

46:49

Data & Infrastructure at Airbnb

Posted by Brenden Matthews  on  Dec 31, 2013

Brenden Matthews describes the infrastructure built at Airbnb using Mesos in order to support Hadoop and Storm.

36:51

Graph Computing at Scale

Posted by Matthias Broecheler  on  Dec 27, 2013

Matthias Broecheler discusses graph computing, introducing the Aurelius graph cluster enabling graph computing at scale by building on distributed systems like Cassandra, HBase, and Hadoop.

38:11

REEF: Retainable Evaluator Execution Framework

Posted by Rusty Sears  on  Dec 10, 2013

Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.

38:16

Apache Tez: Accelerating Hadoop Query Processing

Posted by Bikas Saha, Arun Murthy  on  Dec 05, 2013

Bikas Saha and Arun Murthy detail the design of Tez, highlighting some of its features and sharing some of the initial results obtained by Hive on Tez.

36:32

Raft - The Understandable Distributed Protocol

Posted by Ben Johnson  on  Dec 04, 2013 2

Ben Johnson discusses the Raft protocol and how it works. Raft is a consensus distributed protocol.

43:43

Grails SOA: Building Distributed Scalable Services with Grails and RabbitMQ

Posted by Steve Pember  on  Nov 27, 2013 1

Steve Pember discusses creating Grails applications integrating message broker technologies, especially RabbitMQ, and applying SOA principles.

39:22

Spanner - Google's Distributed Database

Posted by Sebastian Kanthak  on  Nov 25, 2013

Sebastian Kanthak details how Spanner relies on GPS and atomic clocks to provide two of its innovative features: Lock-free strong reads and global snapshots consistent with external events.

37:49

Add ALL the Things: Abstract Algebra Meets Analytics

Posted by Avi Bryant  on  Nov 20, 2013 4

Avi Bryant discusses how the laws of group theory provide a useful codification of the practical lessons of building efficient distributed and real-time aggregation systems.

43:00

Partitions for Everyone!

Posted by Kyle Kingsbury  on  Nov 19, 2013 1

Kyle Kingsbury discusses some of the limitations found in distributed systems and the way some of them behave under partitioning.

52:24

Big Data Platform as a Service at Netflix

Posted by Jeff Magnusson  on  Nov 18, 2013

Jeff Magnusson details some of Netflix' key services: Franklin, Sting and Lipstick.

59:10

Scaling out with Akka Actors

Posted by Joshua Suereth  on  Oct 31, 2013 4

Joshua Suereth designs a scalable distributed search service with Akka and Scala using actors, and covering practical aspects of how to scale out with Akka’s clustering API.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT