InfoQ Homepage Data Content on InfoQ
-
Finding the Needle in a Big Data Haystack
In this solutions track talk, sponsored by Cloudera, Eva Andreasson discusses how search and Hadoop can help with some of the industry's biggest challenges. She introduces the data hub concept.
-
Data Movement at Very Large Scale
In this solutions track talk, sponsored by Solace Systems, Aaron Lee discusses the challenges moving information and techniques that can increase efficiency of data flows within big data architectures
-
New Opportunities for Connected Data
In this solutions track talk, sponsored by Neo Technology, Ian Robinson takes a look at how size, structure and connectivity have converged to transform the data landscape.
-
A Call for Sanity in NoSQL
Nathan Marz discusses building NoSQL-based data systems that are scalable and easy to reason about.
-
Real Data Science at NASA
Chris Mattmann envisions data science by integrating science software into rapid data production systems using cloud computing and open source software.
-
Ember-Data, the Way Forward
Igor Terzic presents several cases where Ember Data is used in production, and outlines some of the features that are intended to be included in the future.
-
Ember.js Advanced Patterns
Paul Chavard discusses advanced techniques for building large EmberJS applications with Ember Data.
-
Deploying Machine Learning and Data Science at Scale
Nick Kolegraff discusses common problems and architecture to support all the phases of data science and how to start a data science initiative, sharing lessons from Accenture, Best Buy, and Rackspace.
-
Spanner - Google's Distributed Database
Sebastian Kanthak details how Spanner relies on GPS and atomic clocks to provide two of its innovative features: Lock-free strong reads and global snapshots consistent with external events.
-
Stream Processing: Philosophy, Concepts, and Technologies
Dan Frank discusses stream data processing and introduces NSQ – Bitly’s open source queuing system – and other new technologies used for communication between streaming programs.
-
Clojure: Enemy of the State
Alex Miller discusses Clojure’s approach to data, comparing it with OOP’s approach, and covering various related topics such as mutation, state vs. value, primitive and composite data.
-
Approximate Methods for Scalable Data Mining
Andrew Clegg overviews methods and provides use cases for performing data sets operations like membership testing, distinct counts, and nearest-neighbour finding more efficiently.