Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to dynamically select the optimal compute framework at each step in the big data analytics process. InfoQ spoke with Matt Schumpert from Datameer team about the new product and how it works to help with big data analytics needs.
The article describes the general outline of the Stats Anomalies Detector we developed at MyHeritage and provides a detailed explanation of how to enhance the code (will be available soon at MyHeritage GitHub) to meet your company’s needs.
Analytics Across the Enterprise: How IBM Realizes Business Value from Big Data and Analytics book by Brenda L. Dietrich, Emily C. Plachy, and Maureen F. Norton is a collection of experiences by analytics practitioners in IBM. InfoQ spoke with the authors about the lessons learned from the book, the arsenal of technologies IBM has about Big Data and the future of Analytics.
This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), and what technologies and products you can choose from. 8
GridGain announced In-Memory Accelerator for Hadoop, offering benefits of in-memory computing to Hadoop applications. InfoQ spoke with Nikita Ivanov from GridGain about the product's architecture.
Spring XD (eXtreme Data) is Pivotal’s Big Data play. It joins Spring Boot and Grails as part of the execution portion of the Spring IO platform. 1
The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.
Lambda Architecture proposes a simpler, elegant paradigm designed to process large amounts of data. In this article, author discusses Lambda Architecture with the help of a sample Java application. 20
This article provides an overview of tools and libraries available for embedded data analytics & statistics, both stand-alone software packages and programming languages with statistical capabilities.
In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.
How to use various tools such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit to build applications that use Hadoop.
Raffi Krikorian, Vice President of Platform Engineering at Twitter, gives an insight on how Twitter prepares for unexpected traffic peaks and how system architecture is designed to support failure. 1