BT
Older rss
  • Big Data Processing with Apache Spark - Part 2: Spark SQL

    by Srini Penchikala on  Apr 16, 2015 1

    Spark SQL, part of Apache Spark big data framework, is used for structured data processing and allows running SQL like queries on Spark data. In this article, Srini Penchikala discusses Spark SQL module and how it simplifies running data analytics using SQL interface. He also talks about the new features in Spark SQL, like DataFrames and JDBC data sources.

  • Highly Distributed Computations Without Synchronization

    by Christopher Meiklejohn on  Feb 17, 2015 1

    Synchronization of data across systems is expensive and impractical when running systems at scale. Traditional approaches for performing computations or information dissemination are not viable. In this article Basho Sr. Software Engineer Chris Meiklejohn explores the basic building blocks for crafting deterministic applications that guarantee convergence of data without synchronization.

  • Big Data Processing with Apache Spark – Part 1: Introduction

    by Srini Penchikala on  Jan 30, 2015 2

    Apache Spark is an open source big data processing framework built around speed, ease of use, and sophisticated analytics. In this article, Srini Penchikala talks about how Apache Spark framework helps with big data processing and analytics with its standard API. He also discusses how Spark compares with traditional MapReduce implementation like Apache Hadoop.

Apache Ignite GridGain Incubator Project - Q&A Interview with Nikita Ivanov

Posted by Srini Penchikala on  Dec 03, 2014

GridGain announced that the In-Memory Data Fabric has been accepted into Apache Incubator program as Apache Ignite. InfoQ spoke with Nikita Ivanov about their product becoming part of Apache.

Interview with Alex Holmes, author of “Hadoop in Practice. Second Edition”

Posted by Boris Lublinsky on  Nov 20, 2014

The new “Hadoop in Practice. 2 Edition" book by Alex Holmes covers a lot of topics building Hadoop code and organizing data to support code simplicity and execution speed.

Matt Schumpert on Datameer Smart Execution

Posted by Srini Penchikala on  Nov 13, 2014

Datameer, a big data analytics application for Hadoop, introduced Datameer 5.0 with Smart Execution to enhance the data analytics. InfoQ spoke with Matt Schumpert from Datameer about the new product.

Stats Anomalies Detector

Posted by Yonatan Harel and Ran Levy on  Nov 07, 2014

The article describes the general outline of the Stats Anomalies Detector developed at MyHeritage and provides a detailed explanation of how to enhance the code to meet your company’s needs.

Analytics Across the Enterprise: How IBM Realizes Business Value from Big Data and Analytics

Posted by Alex Giamas on  Oct 27, 2014

"Analytics Across the Enterprise" book is a collection of experiences by analytics practitioners in IBM. InfoQ spoke with authors about lessons learned and IBM technologies in the Big Data area.

Real-Time Stream Processing as Game Changer in a Big Data World with Hadoop and Data Warehouse

Posted by Kai Wähner on  Sep 10, 2014

This article discusses what stream processing is, how it fits into a big data architecture with Hadoop and a data warehouse (DWH), and what technologies and products you can choose from. 6

Nikita Ivanov on GridGain’s In-Memory Accelerator for Hadoop

Posted by Srini Penchikala on  Sep 08, 2014

GridGain announced In-Memory Accelerator for Hadoop, offering benefits of in-memory computing to Hadoop applications. InfoQ spoke with Nikita Ivanov from GridGain about the product's architecture.

Introducing Spring XD, a Runtime Environment for Big Data Applications

Posted by Charles Humble on  Jul 23, 2014

Spring XD (eXtreme Data) is Pivotal’s Big Data play. It joins Spring Boot and Grails as part of the execution portion of the Spring IO platform. 1

MLConf NYC 2014 Highlights

Posted by Charles Menguy on  Apr 17, 2014

The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.

General Feedback
Bugs
Advertising
Editorial
Marketing
InfoQ.com and all content copyright © 2006-2015 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT