InfoQ Homepage Big Data Content on InfoQ
-
Big Data Problems in Monitoring at eBay
Bhaven Avalani and Yuri Finklestein discuss 4 aspects encountered at eBay when dealing with monitoring data: reduction of data entropy, robust data distribution, metric extraction, efficient storage.
-
100% Big Data, 0% Hadoop, 0% Java
Pavlo Baron presents a big data case, a solution and the tools for collecting, mining and storing large amounts of data without using Hadoop or Java.
-
NoSQL: Past, Present, Future
Eric Brewer takes a look at NoSQL’s history and considers what should be done so the current NoSQL solutions to evolve in order to address the full range of the application needs.
-
Big Data, Small Computers
Cliff Click discusses RAIN, H2O, JMM, Parallel Computation, Fork/Joins in the context of performing big data analysis on tons of commodity hardware.
-
Introducing Apache Hadoop: The Modern Data Operating System
Eli Collins introduces Hadoop: why it came about, the benefits it produces, its history, its architecture, use cases and applications.
-
Petabyte Scale Data at Facebook
Dhruba Borthakur discusses the different types of data used by Facebook and how they are stored, including graph data, semi-OLTP data, immutable data for pictures, and Hadoop/Hive for analytics.
-
Facebook News Feed: Social Data at Scale
Serkan Piantino discusses news feeds at Facebook: the basics, infrastructure used, how feed data is stored, and Centrifuge – a storage solution.
-
Keynote: Spring 2012 and Beyond
Adrian Colyer, Juergen Hoeller, Mark Pollack and Graeme Rocher present SpringSource’s Unifying Component Model, current developments regarding Big Data, and betting on Grails.
-
Not Your Father’s Transaction Processing
Michael Stonebraker compares how RDBMS, NoSQL and NewSQL support today’s big data transaction processing needs. He also introduces VoltDB, an in-memory NewSQL database.
-
Runaway Complexity in Big Data, and a Plan to Stop It
Nathan Marz outlines several sources of complexity introduced in data systems - Lack of human fault-tolerance, Conflation of data and queries, Schemas done wrong - and what can be done to avoid them.
-
MongoDB - Born in the Cloud
Ross Lawley introduces MongoDB, explaining why it is a good solution for cloud deployment.
-
MapReduce and Its Discontents
Dean Wampler discusses the strengths and weaknesses of MapReduce, and the newer variants for big data processing: Pregel and Storm.