F# Deep Dives, edited by Tomas Petricek and Phillip Trelford, is a new book aimed at showing what is the business value that using F# brings in practice. The book presents 11 real industrial scenarios and the way F# allowed field experts to solve them using a functional-first approach. InfoQ has interviewed Tomas Petricek, co-editor of the book.
The new book, The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, looks at a wide range of considerations for cloud-scale systems. In this book review and interview with the authors, we look at how teams can apply proven best practices.
As data grows exponentially, the modern Hadoop ecosystem provides not only a reliable distributed aggregation system that delivers data parallelism, but also analytics for great data insights. In this article Monica Beckwith, starting from core Hadoop components, investigates the design of a highly available, fault tolerant Hadoop cluster, adding security and data-level isolation.
The new “Hadoop in Practice. 2 Edition" book by Alex Holmes covers a lot of topics building Hadoop code and organizing data to support code simplicity and execution speed.
Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to enhance the data analytics. InfoQ spoke with Matt Schumpert from Datameer about the new product.
This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), and what technologies and products you can choose from. 6
GridGain announced In-Memory Accelerator for Hadoop, offering benefits of in-memory computing to Hadoop applications. InfoQ spoke with Nikita Ivanov from GridGain about the product's architecture.
InfoQ spoke with Rich Reimer, VP of Marketing and Product Management at Splice Machine about the architecture and data patterns for SQL-on-Hadoop technologies.
Bikas Saha and Arun Murthy discuss Tez’s design, highlight some of its features and share some of the initial results obtained by making Hive use Tez instead of MapReduce.
The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.
Lambda Architecture proposes a simpler, elegant paradigm designed to process large amounts of data. In this article, author discusses Lambda Architecture with the help of a sample Java application. 7
In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.