Craig Andera explains Datomic from the perspective gained in implementing and optimizing a real-world production system, detailing the Datomic indexing process.
Garrett Eardley explores how Riot Games is leveraging Riak for their stats system, discussing why they chose Riak, the data model and indexes, and strategies for working with eventually consistent data.
Rusty Sears introduces REEF along with examples of computational frameworks, including interactive sessions, iterative graph processing, bulk synchronous computations, Hive queries, and MapReduce.
Nick Kolegraff discusses common problems and architecture to support all the phases of data science and how to start a data science initiative, sharing lessons from Accenture, Best Buy, and Rackspace.
Sebastian Kanthak overviews Spanner, covering details of how Spanner relies on GPS and atomic clocks to provide two of its most innovative features: Lock-free strong (current) reads and global snapshots that are consistent with external events.
Paul King presents working with databases in Groovy, covering datasets, GMongo, Neo4J, raw JDBC, Groovy-SQL, CRUD, Hibernate, caching, Spring Data technologies, etc.
Paco Nathan reviews an example data analysis application written in Cascalog used for a recommender system based on City of Palo Alto Open Data.
Avi Bryant discusses how the laws of group theory provide a useful codification of the practical lessons of building efficient distributed and real-time aggregation systems.
Jeff Magnusson takes a deep dive into key services of Netflix’s “data platform as a service” architecture, including RESTful services that: provide comprehensive metadata management across data sources (Franklin); enable visualization and caching of results of Hadoop jobs (Sting); and visualize the execution plans produced by languages such as Pig and Hive (Lipstick).
Tamar Bercovici presents Box’s transition from a single MySQL database to a fully sharded MySQL architecture, all the while serving 2 billion queries per day.
Crista Lopes writes a program in multiple styles -monolithic/OOP/continuations/relational/Pub-Sub/Monads/AOP/Map-reduce- showing the value of using more than a style in large scale systems.
Dan Frank discusses stream data processing and introduces NSQ – Bitly’s open source queuing system – and other new technologies used for communication between streaming programs.
CONTENT IN THIS BOX PROVIDED BY OUR SPONSOR
- 10 Things Developers Should Know about Couchbase
- When one is better than two: Collapsing data management layers for scalability and simplicity
- Couchbase NoSQL @ Tunewiki : A billion documents and counting
- The Essential Couchbase APIs Cheat Sheet
- Why MySQL 5.6 is no real threat to NoSQL
- How to Move from MySQL to Couchbase Server 2.0: Part 1
- Making Sense of NoSQL
- Couchbase in Action – Real world app demo
- Making the Shift from Relational to NoSQL