BT

Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ

Topics

Choose your language

InfoQ Homepage Presentations Distributed Data Analysis with Hadoop and R

Distributed Data Analysis with Hadoop and R

Bookmarks
47:03

Summary

Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.

Bio

Jonathan Seidman is Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide and co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group. Ramesh Venkataramaiah is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain.

About the conference

Strange Loop is a multi-disciplinary conference that aims to bring together the developers and thinkers building tomorrow's technology in fields such as emerging languages, alternative databases, concurrency, distributed systems, mobile development, and the web.

Recorded at:

Mar 09, 2012

BT