BT
  • Mike Barlow on Real-Time Big Data Analytics

    by Srini Penchikala on  Jun 01, 2013 3

    "Real-Time Big Data Analytics: Emerging Architecture" white paper authored by Mike Barlow covers big data analytics topic and how real-time big data analytics (RTBDA) are different from traditional analytics. InfoQ spoke with Mike about the current state of real-time big data analytics and the emerging trends in the Big Data space like Decision Science.

  • Interview and Video Review: Working with Big Data: Infrastructure, Algorithms, and Visualizations

    by Aslan Brooke on  May 02, 2013

    Paul Dix leads a practical exploration into Big Data in this video training series. The first five lessons of the training span multiple server systems with a focus on the end to end processing of large quantities of XML data from real Stack Exchange posts. He completes the training with a lesson on developing visualizations for gaining insights from the macro level analysis of Big Data.

  • Apache Crunch: A Java Library for Easier MapReduce Programming

    by Josh Wills on  Dec 27, 2012

    In his new article Josh Wills introduces Crunch - a new Apache incubating project providing a Java library for creating MapReduce pipelines. Crunch is based on a set of high level abstractions simplifying MapReduce applications design and provides library of patterns to implement common tasks like data joins, aggregations, and sorting.

Unit Testing Hadoop MapReduce Jobs With MRUnit, Mockito, & PowerMock

Posted by Michael Spicuzza on  Dec 03, 2012

Hadoop MapReduce jobs have a unique code architecture that raises interesting issues for test-driven development. In this article Michael Spicuzza shows how to use MRUnit to solve these problems. 1

Interview and Book Review: NoSQL Distilled

Posted by Srini Penchikala on  Nov 29, 2012

InfoQ spoke with NoSQL Distilled book authors, Pramod Sadalage and Martin Fowler about NoSQL database space and the emerging trends in NoSQL.

The State of NoSQL

Posted by Stefan Edlich on  Nov 21, 2012

Stefan Edlich reviews NoSQL, considering its evolution, financial impact, standards or their lack of, current landscape, books, the leaders and some newcomers, concluding that NoSQL is here to stay. 3

Hadoop Virtual Panel

Posted by Boris Lublinsky on  Nov 20, 2012

In this virtual panel, InfoQ talks to several Hadoop vendors and users about their views at current and future state of Hadoop.

The Architecture of Datomic

Posted by Rich Hickey on  Nov 02, 2012

Rich Hickey, the author of Clojure, explains the architecture of Datomic - a new database designed as a composition of simple services, combining the capabilities of RDBMS and scalability of NoSQL. 2

Julien Nioche on Apache Nutch 2 Features and Product Roadmap

Posted by Srini Penchikala on  Nov 01, 2012

Open source web-search framework Apache Nutch version 2 supports link-graph database and HTML parsing. InfoQ spoke with Julien Nioche, VP of Apache Nutch project, about the new features.

Blueprint for a Big Data Solution

Posted by Jonathan Natkins on  Oct 31, 2012

In his new article Jonathan Natkins explains how to use components of Apache Hadoop, including Flume, Hive and Oozie to implement a typical Data management system. 2

Inside the Complexity of Delivering Cloud Computing

Posted by JP Morgenthal on  Oct 03, 2012

This article answers the question, is cloud computing really all that hard? 2

Hadoop and Metadata (Removing the Impedance Mis-match)

Posted by Alan Gates, Russell Jurney on  Sep 26, 2012

A new Apache HCatalog provides a metadata and table management system for Hadoop ecosystem, simplifying data interoperability between different data processing tools

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT