InfoQ Homepage Big Data Content on InfoQ
-
Leveraging Your Hadoop Cluster Better - Running Performant Code at Scale
Michael Kopp explains how to run performance code at scale with Hadoop and how to analyze and optimize Hadoop jobs.
-
Lessons Learned Building Storm
Nathan Marz shares lessons learned building Storm, an open-source, distributed, real-time computation system.
-
Building Applications using Apache Hadoop
Eli Collins overviews how to build new applications with Hadoop and how to integrate Hadoop with existing applications, providing an update on the state of Hadoop ecosystem, frameworks and APIs.
-
Copious Data, the "Killer App" for Functional Programming
Dean Wampler supports using Functional Programming and its core operations to process large amounts of data, explaining why Java’s dominance in Hadoop is harming Big Data’s progress.
-
Cloud and Big Data: Unicorns All the Way Down
Francine Bennett keynotes on using big data in the cloud.
-
The Big Data Revolution
Claudia Perlich keynotes on M6D’s approach to Big Data, using data granularity to build predictive models used for user targeting, bid optimization and fraud detection.
-
The Why, What and How of Open Data
Jeni Tennison explains how to evaluate an organization's data assets as potential sources of open data, and how to deal with the thorny issues of derived and personal data.
-
APIs in Government
Gray Brooks discusses the efforts around creating APIs for accessing the vast amounts of data administered by the US Government.
-
Expert Panel - The Relationship Between Big Data and the Semantic Web
Paul Buhler, Steve Hamby, Johan Kumps, Art Ligthart, Markus Zirn, and Clemens Utschig discuss the relationships between Big Data and established Semantic Web technologies.
-
Introduction to Spring Data
Mark Pollack provides a guided tour plus demos of the Spring Data feature set.
-
Making Hadoop Real Time with Scala & GridGain
Nikita Ivanov shows adding real-time capabilities to Hadoop through a demo application streaming word counting on a 2-nodes cluster.
-
Apache Cassandra Anti Patterns
Matthew Dennis covers the most common mistakes made with Cassandra that he has noticed being made both in deployment and code.