InfoQ Homepage Distributed Systems Content on InfoQ
-
Leveraging Your Hadoop Cluster Better - Running Performant Code at Scale
Michael Kopp explains how to run performance code at scale with Hadoop and how to analyze and optimize Hadoop jobs.
-
Lessons Learned Building Storm
Nathan Marz shares lessons learned building Storm, an open-source, distributed, real-time computation system.
-
Building Applications using Apache Hadoop
Eli Collins overviews how to build new applications with Hadoop and how to integrate Hadoop with existing applications, providing an update on the state of Hadoop ecosystem, frameworks and APIs.
-
Copious Data, the "Killer App" for Functional Programming
Dean Wampler supports using Functional Programming and its core operations to process large amounts of data, explaining why Java’s dominance in Hadoop is harming Big Data’s progress.
-
Cascalog: Logic Programming over Hadoop
Alex Robbins introduces Cascalog, a Clojure library for writing declarative Hadoop jobs.
-
Introduction to Actors Systems
Josh Suereth designs a distributed search service with Akka using Actors, covering: message passing, designing topologies, handling failure, service overload detection and tracking user sessions.
-
Cloud Foundry Architecture
Ramnivas Laddad sketches the architecture of Cloud Foundry, explaining how they manage to do hot swaps without application downtime, including lessons applicable in general distributed environments.
-
Architectural Patterns for High Availability
Adrian Cockcroft presents Netflix globally distributed architecture, the benchmarks used, scalability issues, and the open source components their implementation is based upon.
-
Designing Fault Tolerant Distributed Applications
Scott Andreas discussing creating fault tolerant distributed applications, and demoes Ordasity, a framework for building self-organizing systems with services.
-
Running the Largest Hadoop DFS Cluster
Hairong Kuang explains how Facebook uses HDFS to store and analyze over 100PB of user log data.
-
Software Architecture using ZeroMQ
Pieter Hintjens explains how to use contracts and rapid iterative design cycles to architect large-scale distributed systems with ZeroMQ.
-
Making Hadoop Real Time with Scala & GridGain
Nikita Ivanov shows adding real-time capabilities to Hadoop through a demo application streaming word counting on a 2-nodes cluster.