-
Followers
Book Review and Interview: The Practice of Cloud System Administration
The new book, The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems, looks at a wide range of considerations for cloud-scale systems. In this book review and interview with the authors, we look at how teams can apply proven best practices.
-
Java Java Follow 720 Followers
Designing a Highly Available, Fault Tolerant, Hadoop Cluster with Data Isolation
As data grows exponentially, the modern Hadoop ecosystem provides not only a reliable distributed aggregation system that delivers data parallelism, but also analytics for great data insights. In this article Monica Beckwith, starting from core Hadoop components, investigates the design of a highly available, fault tolerant Hadoop cluster, adding security and data-level isolation.
-
Followers
Interview with Alex Holmes, author of “Hadoop in Practice. Second Edition”
The new “Hadoop in Practice. Second Edition” book by Alex Holmes provides a deep insight into Hadoop ecosystem covering a wide spectrum of topics such as data organization, layouts and serialization, data processing, including MapReduce and big data patterns, special structures along with their usage to simplify big data processing, and SQL on Hadoop data.
Matt Schumpert on Datameer Smart Execution
Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to enhance the data analytics. InfoQ spoke with Matt Schumpert from Datameer about the new product.
Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse
Nikita Ivanov on GridGain’s In-Memory Accelerator for Hadoop
GridGain announced In-Memory Accelerator for Hadoop, offering benefits of in-memory computing to Hadoop applications. InfoQ spoke with Nikita Ivanov from GridGain about the product's architecture.
Rich Reimer on SQL-on-Hadoop Databases and Splice Machine
InfoQ spoke with Rich Reimer, VP of Marketing and Product Management at Splice Machine about the architecture and data patterns for SQL-on-Hadoop technologies.
Lambda Architecture: Design Simpler, Resilient, Maintainable and Scalable Big Data Solutions
Big Data Analytics for Security
In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.
Building Applications With Hadoop
Building a Real-time, Personalized Recommendation System with Kiji
Jon Natkins explains in this article how to create a personalized recommendation system fed with large amounts of real-time data using Kiji, which leverages HBase, Avro, Map-Reduce and Scalding.
Costin Leau on Elasticsearch, BigData and Hadoop
Elasticsearch is an open source, distributed real-time search and analytics engine for the cloud. InfoQ spoke with Costin Leau about Elasticsearch and how it integrates with Hadoop and Big Data.