BT
Older Newer rss
36:45

Apache Drill - Interactive Query and Analysis at Scale

Posted by Michael Hausenblas  on  Oct 13, 2013

Michael Hausenblas introduces Apache Drill, a distributed system for interactive analysis of large-scale datasets, including its architecture and typical use cases.

28:12

A Guide to Python Frameworks for Hadoop

Posted by Uri Laserson  on  Oct 03, 2013

Uri Laserson reviews the different available Python frameworks for Hadoop, including a comparison of performance, ease of use/installation, differences in implementation, and other features.

37:10

Evolving Panorama of Data

Posted by Rebecca Parsons  on  Oct 02, 2013

Rebecca Parsons reviews some of the changes in how data is used and analyzed, including new technology approaches, looking at how data is used to track election violence, movement of people after a natural disaster, and attempts to predict famine and other humanitarian crises before they happen.

43:33

Leveraging Scriptable Infrastructures, Towards a Paradigm Shift in Software for Data Science

Posted by Karim Chine  on  Oct 02, 2013

Karim Chine introduces Elastic-R, demonstrating some of its applications in bioinformatics and finance.

51:42

Data Science of Love

Posted by Vaclav Petricek  on  Aug 17, 2013

Vaclav Petricek digs some of the romantic interactions nuggets hidden in eHarmony's large collection of human relationships.

35:50

Leveraging Your Hadoop Cluster Better - Running Performant Code at Scale

Posted by Michael Kopp  on  Aug 16, 2013

Michael Kopp explains how to run performance code at scale with Hadoop and how to analyze and optimize Hadoop jobs.

44:03

Lessons Learned Building Storm

Posted by Nathan Marz  on  Aug 11, 2013 2

Nathan Marz shares lessons learned building Storm, an open-source, distributed, real-time computation system.

30:33

Building Applications using Apache Hadoop

Posted by Eli Collins  on  Aug 11, 2013

Eli Collins overviews how to build new applications with Hadoop and how to integrate Hadoop with existing applications, providing an update on the state of Hadoop ecosystem, frameworks and APIs.

46:43

Copious Data, the "Killer App" for Functional Programming

Posted by Dean Wampler  on  Aug 03, 2013 2

Dean Wampler supports using Functional Programming and its core operations to process large amounts of data, explaining why Java’s dominance in Hadoop is harming Big Data’s progress.

40:07

Cloud and Big Data: Unicorns All the Way Down

Posted by Francine Bennett  on  Jul 20, 2013

Francine Bennett keynotes on using big data in the cloud.

59:33

The Big Data Revolution

Posted by Claudia Perlich  on  Jun 16, 2013

Claudia Perlich keynotes on M6D’s approach to Big Data, using data granularity to build predictive models used for user targeting, bid optimization and fraud detection.

39:32

The Why, What and How of Open Data

Posted by Jeni Tennison  on  Jun 11, 2013

Jeni Tennison explains how to evaluate an organization's data assets as potential sources of open data, and how to deal with the thorny issues of derived and personal data.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT