Martin Kleppmann on Using Logs for Building Data Infrastructure, CAP, CRDTs
Jun 28, 2015
Martin Kleppmann explains how logs are used to implement systems (DBs, replication, consensus systems, etc), integrating DBs and log-based systems, the relevance of CAP and CRDTs, and much more.
How Twitter Answers Handles Five Billion Sessions a Day by Sergio De Simone Posted on Mar 09, 2015
Gobblin, LinkedIn's Unified Data Ingestion Platform by Mikio Braun Posted on Dec 15, 2014
Microsoft Expands Azure Machine Learning and Real Time Analytics Offering by Alex Giamas Posted on Oct 31, 2014
Apache Kafka - A Different Kind of Messaging System by Bienvenido David Posted on Dec 16, 2013
Samza in LinkedIn: How LinkedIn Processes Billions of Events Everyday in Real-time
Dec 05, 2014
Neha Narkhede of Kafka fame shares the experience of building LinkedIn's powerful and efficient data pipeline infrastructure around Apache Kafka and Samza to process billions of events every day.
The Game of Big Data: Scalable, Reliable Analytics Infrastructure at KIXEYE
Jul 19, 2014
Randy Shoup describes KIXEYE's analytics infrastructure from Kafka queues through Hadoop 2 to Hive and Redshift, built for flexibility, experimentation, iteration, componentization, testability, reliability and replay-ability.
Apache Kafka: Next Generation Distributed Messaging System
Jun 04, 2014
Apache Kafka is a distributed publish-subscribe messaging system. This article covers the architecture model, features and characteristics of Kafka framework and how it compares with traditional messaging systems.