BT
Older rss
  • Interview with Alex Holmes, author of “Hadoop in Practice. Second Edition”

    by Boris Lublinsky on  Nov 20, 2014

    The new “Hadoop in Practice. Second Edition” book by Alex Holmes provides a deep insight into Hadoop ecosystem covering a wide spectrum of topics such as data organization, layouts and serialization, data processing, including MapReduce and big data patterns, special structures along with their usage to simplify big data processing, and SQL on Hadoop data.

  • Matt Schumpert on Datameer Smart Execution

    by Srini Penchikala on  Nov 13, 2014

    Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to dynamically select the optimal compute framework at each step in the big data analytics process. InfoQ spoke with Matt Schumpert from Datameer team about the new product and how it works to help with big data analytics needs.

  • Stats Anomalies Detector

    by Yonatan Harel and Ran Levy on  Nov 07, 2014

    The article describes the general outline of the Stats Anomalies Detector we developed at MyHeritage and provides a detailed explanation of how to enhance the code (will be available soon at MyHeritage GitHub) to meet your company’s needs.

Analytics Across the Enterprise: How IBM Realizes Business Value from Big Data and Analytics

Posted by Alex Giamas on  Oct 27, 2014

"Analytics Across the Enterprise" book is a collection of experiences by analytics practitioners in IBM. InfoQ spoke with authors about lessons learned and IBM technologies in the Big Data area.

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

Posted by Kai Wähner on  Sep 10, 2014

This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), and what technologies and products you can choose from. 5

Nikita Ivanov on GridGain’s In-Memory Accelerator for Hadoop

Posted by Srini Penchikala on  Sep 08, 2014

GridGain announced In-Memory Accelerator for Hadoop, offering benefits of in-memory computing to Hadoop applications. InfoQ spoke with Nikita Ivanov from GridGain about the product's architecture.

Introducing Spring XD, a Runtime Environment for Big Data Applications

Posted by Charles Humble on  Jul 23, 2014

Spring XD (eXtreme Data) is Pivotal’s Big Data play. It joins Spring Boot and Grails as part of the execution portion of the Spring IO platform. 1

MLConf NYC 2014 Highlights

Posted by Charles Menguy on  Apr 17, 2014

The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.

Lambda Architecture: Design Simpler, Resilient, Maintainable and Scalable Big Data Solutions

Posted by Daniel Jebaraj on  Mar 12, 2014

Lambda Architecture proposes a simpler, elegant paradigm designed to process large amounts of data. In this article, author discusses Lambda Architecture with the help of a sample Java application. 7

Embedded Analytics and Statistics for Big Data

Posted by Panos Louridas and Christof Ebert on  Feb 23, 2014

This article provides an overview of tools and libraries available for embedded data analytics & statistics, both stand-alone software packages and programming languages with statistical capabilities.

Big Data Analytics for Security

Posted by Alvaro A. Cárdenas, Pratyusa K. Manadhata, Sreeranga P. Rajan on  Feb 11, 2014

In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.

Building Applications With Hadoop

Posted by Roopesh Shenoy on  Jan 30, 2014

How to use various tools such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit to build applications that use Hadoop.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT