InfoQ Homepage Big Data Content on InfoQ
-
APIs in Government
Gray Brooks discusses the efforts around creating APIs for accessing the vast amounts of data administered by the US Government.
-
Expert Panel - The Relationship Between Big Data and the Semantic Web
Paul Buhler, Steve Hamby, Johan Kumps, Art Ligthart, Markus Zirn, and Clemens Utschig discuss the relationships between Big Data and established Semantic Web technologies.
-
Introduction to Spring Data
Mark Pollack provides a guided tour plus demos of the Spring Data feature set.
-
Making Hadoop Real Time with Scala & GridGain
Nikita Ivanov shows adding real-time capabilities to Hadoop through a demo application streaming word counting on a 2-nodes cluster.
-
Apache Cassandra Anti Patterns
Matthew Dennis covers the most common mistakes made with Cassandra that he has noticed being made both in deployment and code.
-
How to Build Big Data Pipelines for Hadoop Using OSS
Costin Leau discusses Big Data, current available tools for dealing with it, and how Spring can be used to create Big Data pipelines.
-
F# Big Data Scripting
Matthew Moloney shares some of the F# tools built at Microsoft Research for dealing with Big Data.
-
The Evolving Panorama of Data
Rebecca Parsons proposes taking a different look at data, using different approaches and tools, then looks at some of the ways social data is used these days.
-
Scaling Scalability: Evolving Twitter Analytics
Dmitriy Ryaboy shares some of the lessons learned scaling Twitter’s analytics infrastructure: Data loves a schema, Make data sources discoverable, and Make costs visible.
-
Lean Data Architecture: Minimize Investment, Maximize Value
Manvir Singh Grewal and Brandon Byars propose a business intelligence workflow along with Lean principles and practices for implementing a data warehouse and reporting capability.
-
Storm: Distributed and Fault-Tolerant Real-time Computation
Nathan Marz introduces Twitter Storm, outlining its architecture and use cases, and takes a look at future features to be made available.
-
Extending the Enterprise Data Warehouse with Hadoop
Rob Lancaster explains the steps made by Orbitz in order to bridge the gap between their data in the data warehouse and the data in Hadoop.