InfoQ Homepage Distributed Systems Content on InfoQ
-
Riak Core: Dynamo Building Blocks
Andy Gross discusses the design philosophy behind Riak based on Amazon Dynamo - Gossip Protocol, Consistent Hashing, Vector clocks, Read Repair, etc. -, overviewing its main features and architecture.
-
Being Elastic - Evolving Programming for the Cloud
Randy Shoup discusses the cloud programming model, covering topics such as state/statelessness, distribution, workload partitioning, cost and resource metering, automation, and deployment strategies.
-
NoSQL at Twitter
Ryan King presents how Twitter uses NoSQL technologies - Gizzard, Cassandra, Hadoop, Redis - to deal with increasing data amounts forcing them to scale out beyond what the traditional SQL has to offer
-
NoSQL at Twitter
Kevin Weil presents how Twitter does data analysis using Scribe for logging, base analysis with Pig/Hadoop, and specialized data analysis with HBase, Cassandra, and FlockDB.
-
Large Scale Map-Reduce Data Processing at Quantcast
Ron Bodkin presents the architecture used by Quantcast to process 100s of TB of data daily using Hadoop on dedicated systems, the applications, the type of data processed, and the infrastructure used.
-
Development at the Speed and Scale of Google
Ashish Kumar on how Google keeps the source code of over 2000 projects in a single code trunk containing 100s of M of code lines, with more than 5,000 developers accessing the same repository.
-
Availability, the Cloud and Everything
Joe Williams discusses how distributed systems, cloud computing and configuration management affect system’s availability. He exemplifies with a database service built on CouchDB, Erlang, Chef, EC2.
-
Global Software Delivery with Distributed Agile
Matthew Simons and Steven Boswell consider that distributed software development is a strategic capability for a company, presenting a framework and Agile practices for building such an environment.
-
Test-Driven Development of Asynchronous Systems
Nat Pryce exemplifies how he dealt with flickering, false positives, slow, and messy tests appearing in asynchronous testing when trying to perform end-to-end testing.
-
Social Networks: Getting Distributed Web Services Done with NoSQL
Lars George and Fabrizio Schmidt present Germany’s largest social networks, Schuelervz, Studivz and Meinvz, the initial architecture, why it didn’t work and how they solved it with a NoSQL solution.
-
Embracing Concurrency At Scale
Justin Sheehy explains the principles behind concurrent distributed systems: no global state, no ACID but rather BASE, no RPC but protocols over APIs, prepare for failure, degradation, measurement.
-
Horizontal Scalability via Transient, Shardable, and Share-Nothing Resources
Adam Wiggins details how memcached, CouchDB, Hadoop, Redis, Varnish, RabbitMQ, Erlang apply the transient, shardable and share-nothing principles to achieve horizontal scalability.