BT
rss
38:01

Tracking Millions of Ganks in Near Real Time

Posted by Garrett Eardley  on  Dec 12, 2013

Garrett Eardley explores how Riot Games is leveraging Riak for their stats system, discussing why they chose Riak, the data model and indexes, and strategies for working with eventually consistent data.

49:34

Online Controlled Experiments: Introduction, Insights, Scaling, and Humbling Statistics

Posted by Ronny Kohavi  on  Dec 12, 2013 1

Ronny Kohavi shares lessons learned, cultural and scaling challenges conducting hundreds of concurrent online controlled experiments at Bing.

Distributed Data Analysis with Hadoop and R

Posted by Jonathan Seidman and Ramesh Venkataramaiah  on  Mar 09, 2012 2

Jonathan Seidman and Ramesh Venkataramaiah present how they run R on Hadoop in order to perform distributed analysis on large data sets, including some alternatives to their solution.

Storm: Distributed and Fault-tolerant Real-time Computation

Posted by Nathan Marz  on  Oct 21, 2011 1

Nathan Marz explain Storm, a distributed fault-tolerant and real-time computational system currently used by Twitter to keep statistics on user clicks for every URL and domain.

Machine Learning: A Love Story

Posted by Hilary Mason  on  Nov 09, 2010 16

Hilary Mason presents the history of machine learning covering some of the most significant developments taking place over the last two decades, especially the fundamental math and algorithmic tools employed. She also exemplifies how machine learning is used by bit.ly to discover various statistical information about users.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT