Older rss

Oozie Plugin for Eclipse

Posted by Ahmed Mahran on  Oct 30, 2015

A new Eclipse Oozie plugin allows to significantly simplify implementation of Oozie processes by allowing to define them graphically. An article introduces plugin and provides an example of its usage. 1

Elixir in Action Review and Q&A with the Author

Posted by Sergio De Simone on  Aug 08, 2015

Elixir in action aims to introduce readers to Elixir and the Erlang virtual machine while also discussing concurrent programming topics, fault-tolerance, and topics related to high-availability.

Big Data as a Service, an Interview with Google's William Vambenepe

Posted by Chris Swan on  Jul 06, 2015

An interview with Google's William Vambenepe, who's lead product manager for Big Data services, to ask him about the shift from products to services when working with Big Data.

F# Deep Dives Review and Author Q&A

Posted by Sergio De Simone on  Feb 18, 2015

F# Deep Dives is a new book aimed at showing the business value that using F# brings in practice. It presents 11 industrial scenarios and their solution with F# using a functional-first approach.

Book Review and Interview: The Practice of Cloud System Administration

Posted by Richard Seroter on  Dec 18, 2014

The new book, The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, looks at a wide range of considerations for cloud-scale systems.

Designing a Highly Available, Fault Tolerant, Hadoop Cluster with Data Isolation

Posted by Monica Beckwith on  Dec 16, 2014

In this article Monica Beckwith, starting from core Hadoop components, investigates the design of a highly available, fault tolerant Hadoop cluster, adding security and data-level isolation.

Interview with Alex Holmes, author of “Hadoop in Practice. Second Edition”

Posted by Boris Lublinsky on  Nov 20, 2014

The new “Hadoop in Practice. 2 Edition" book by Alex Holmes covers a lot of topics building Hadoop code and organizing data to support code simplicity and execution speed.

Matt Schumpert on Datameer Smart Execution

Posted by Srini Penchikala on  Nov 13, 2014

Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to enhance the data analytics. InfoQ spoke with Matt Schumpert from Datameer about the new product.

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

Posted by Kai Wähner on  Sep 10, 2014

This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), and what technologies and products you can choose from. 8