BT

New Early adopter or innovator? InfoQ has been working on some new features for you. Learn more

You are now in FULL VIEW
CLOSE FULL VIEW

Distributed Data Analysis with Hadoop and R
Recorded at:

| by Jonathan Seidman Ramesh Venkataramaiah on Mar 09, 2012 |
47:03

Summary
Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.

Sponsored Content

Bio

Jonathan Seidman is Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide and co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group. Ramesh Venkataramaiah is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain.

Strange Loop is a multi-disciplinary conference that aims to bring together the developers and thinkers building tomorrow's technology in fields such as emerging languages, alternative databases, concurrency, distributed systems, mobile development, and the web.

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT