S Aerni, S Ramanujam and J Vawdrey present approaches and open source tools for wrangling and modeling massive datasets, scaling Java applications for NLP on MPP through PL/Java and much more.
Joseph Blomstedt presents ongoing work to build a new set of high performance data structures for Erlang, including both single process data structures as well as various concurrent data structures.
Joe Stein introduces Mesos and managing data services on it, presenting use cases for replacing classic solutions (like cold storage) with new functionality based on these technology.
Big Design Upfront was considered so evil in the early days of Agile that it acquired its own acronym. It’s time we relearned that great products start with asking the right questions.
Peter Lawrey discusses data-driven reactive systems, profiling latency distribution in such an environment, finding rare bugs, implementing resilience and monitoring.
Simon Metson approaches the problem of evolving a data system; some patterns and anti-patterns both technical (polyglot systems, lambda architectures) and organisational (data silos, lava layers).
Dave McCrory talks about what is Data Gravity, how it affects performance and portability and why these effects are amplified when there are larger volumes of data.
Peter Bourgon provides a practical introduction to Conflict-free Replicated Data Types (CRDTs) and describes a production CRDT system built at SoundCloud to serve several product features.
Mark Madsen explains the history of databases and data processing over the past decades and looks where the industry will go.
The authors discuss some of the unique challenges they've faced delivering highly personalized search over semi-structured data at massive scale.
John Canfield discusses the changing payment ecosystem, innovations in mining and organizing unstructured data from many sources, and approaches to deciding for loss minimization and user experience.
Ian Robinson takes a look at how size, structure and connectedness have converged to change the way we work with data, showing some new opportunities with connected data illustrated with graph search.