InfoQ Homepage Distributed Systems Content on InfoQ

Presentations

RSS Feed

Newer Older

NoSQL at Twitter

Kevin Weil presents how Twitter does data analysis using Scribe for logging, base analysis with Pig/Hadoop, and specialized data analysis with HBase, Cassandra, and FlockDB.

Kevin Weil
on Dec 23, 2010

Icon

55:43
Large Scale Map-Reduce Data Processing at Quantcast

Ron Bodkin presents the architecture used by Quantcast to process 100s of TB of data daily using Hadoop on dedicated systems, the applications, the type of data processed, and the infrastructure used.

Ron Bodkin
on Dec 21, 2010

Icon

58:49
Development at the Speed and Scale of Google

Ashish Kumar on how Google keeps the source code of over 2000 projects in a single code trunk containing 100s of M of code lines, with more than 5,000 developers accessing the same repository.

Ashish Kumar
on Dec 13, 2010

Icon

55:17
Availability, the Cloud and Everything

Joe Williams discusses how distributed systems, cloud computing and configuration management affect system’s availability. He exemplifies with a database service built on CouchDB, Erlang, Chef, EC2.

Joe Williams
on Dec 09, 2010

Icon

41:02
Global Software Delivery with Distributed Agile

Matthew Simons and Steven Boswell consider that distributed software development is a strategic capability for a company, presenting a framework and Agile practices for building such an environment.

Matthew Simons Steven Boswell
on Oct 21, 2010

Icon

01:02:58
Test-Driven Development of Asynchronous Systems

Nat Pryce exemplifies how he dealt with flickering, false positives, slow, and messy tests appearing in asynchronous testing when trying to perform end-to-end testing.

Nat Pryce
on Sep 17, 2010

Icon

55:34
Social Networks: Getting Distributed Web Services Done with NoSQL

Lars George and Fabrizio Schmidt present Germany’s largest social networks, Schuelervz, Studivz and Meinvz, the initial architecture, why it didn’t work and how they solved it with a NoSQL solution.

Lars George Fabrizio Schmidt
on Jun 29, 2010

Icon

52:18
Embracing Concurrency At Scale

Justin Sheehy explains the principles behind concurrent distributed systems: no global state, no ACID but rather BASE, no RPC but protocols over APIs, prepare for failure, degradation, measurement.

Justin Sheehy
on Jun 23, 2010

Icon

53:11
Horizontal Scalability via Transient, Shardable, and Share-Nothing Resources

Adam Wiggins details how memcached, CouchDB, Hadoop, Redis, Varnish, RabbitMQ, Erlang apply the transient, shardable and share-nothing principles to achieve horizontal scalability.

Adam Wiggins
on Apr 20, 2010

Icon

39:55
Facebook’s Petabyte Scale Data Warehouse using Hive and Hadoop

Ashish Thusoo and Namit Jain explain how Facebook manages to deal with analysis of 12 TB of compressed new data everyday with Hive’s help, an open source data warehousing framework built on Hadoop.

Ashish Thusoo Namit Jain
on Feb 21, 2010

Icon

58:26
RPC and its Offspring: Convenient, Yet Fundamentally Flawed

Steve Vinoski covers the history of RPC, standardization, distributed objects, CORBA, DCOM, Java, SOAP, WS-*, flaws in RPC, REST vs RPC philosophy, Erlang reliability and concurrency.

Steve Vinoski
on Dec 19, 2009

Icon

01:04:18
Hypertable - An Open Source, High Performance, Scalable Database

This presentation discusses Hypertable, an open source, high performance, distributed database modeled after Google's Bigtable. Doug offers a comprehensive discussion of all aspects of Hypertable.

Doug Judd
on Jul 31, 2009

Icon

53:00

Newer Presentations

Older Presentations

InfoQ Software Architects' Newsletter

Presentations