InfoQ Homepage Presentations Distributed Data Analysis with Hadoop and R
Distributed Data Analysis with Hadoop and R
Summary
Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.
Bio
Jonathan Seidman is Lead Engineer on the Business Intelligence/Big Data team at Orbitz Worldwide and co-founder and organizer of the Chicago Hadoop User Group and founder of the Chicago Big Data User Group. Ramesh Venkataramaiah is a member of the Operations and Engineering Team at Orbitz Worldwide with a focus on analysis of distributed, high availability systems in the travel data domain.
About the conference
Strange Loop is a multi-disciplinary conference that aims to bring together the developers and thinkers building tomorrow's technology in fields such as emerging languages, alternative databases, concurrency, distributed systems, mobile development, and the web.
Community comments
Strange Loop
by Alex Miller,
Strange Loop
by Alex Miller,
Your message is awaiting moderation. Thank you for participating in the discussion.
If you're interested in other upcoming videos from Strange Loop, the full release schedule is here and all slides are here. If you want to be notified about Strange Loop announcements in the future, sign up for the mailing list.