BT
Older rss
  • Rich Reimer on SQL-on-Hadoop Databases and Splice Machine

    by Srini Penchikala on  Jun 19, 2014

    SQL-on-Hadoop technologies include a SQL layer or a SQL database over Hadoop. These solutions are becoming popular recently as they solve the data management issues of Hadoop and provide a scale-out alternative for traditional RDBMSs. InfoQ spoke with Rich Reimer, VP of Marketing and Product Management at Splice Machine about the architecture and data patterns for SQL in Hadoop databases.

  • What is Apache Tez?

    by Roopesh Shenoy on  Apr 25, 2014

    Apache Tez is a new distributed execution framework that is targeted to-wards data-processing applications on Hadoop. But what exactly is it? How does it work? In the presentation, “Apache Tez: Accelerating Hadoop Query Processing”, Bikas Saha and Arun Murthy discuss Tez’s design, highlight some of its features and share initial results obtained by making Hive use Tez instead of MapReduce.

  • MLConf NYC 2014 Highlights

    by Charles Menguy on  Apr 17, 2014

    The MLConf conference was going strong in NYC on April 11th and was a full day packed with talks around Machine Learning and Big Data, featuring speakers from many prominent companies.

Lambda Architecture: Design Simpler, Resilient, Maintainable and Scalable Big Data Solutions

Posted by Daniel Jebaraj on  Mar 12, 2014

Lambda Architecture proposes a simpler, elegant paradigm designed to process large amounts of data. In this article, author discusses Lambda Architecture with the help of a sample Java application. 5

Big Data Analytics for Security

Posted by Alvaro A. Cárdenas, Pratyusa K. Manadhata, Sreeranga P. Rajan on  Feb 11, 2014

In this article, authors discuss the role of big data and Hadoop in security analytics space and how to use MapReduce to process data for security analysis.

Building Applications With Hadoop

Posted by Roopesh Shenoy on  Jan 30, 2014

How to use various tools such as Apache Avro, Apache Crunch, Cloudera ML and the Cloudera Development Kit to build applications that use Hadoop.

Building a Real-time, Personalized Recommendation System with Kiji

Posted by Jon Natkins on  Dec 26, 2013

Jon Natkins explains in this article how to create a personalized recommendation system fed with large amounts of real-time data using Kiji, which leverages HBase, Avro, Map-Reduce and Scalding.

Costin Leau on Elasticsearch, BigData and Hadoop

Posted by Srini Penchikala on  Nov 15, 2013

Elasticsearch is an open source, distributed real-time search and analytics engine for the cloud. InfoQ spoke with Costin Leau about Elasticsearch and how it integrates with Hadoop and Big Data.

Spoilt for Choice – How to choose the right Big Data / Hadoop Platform?

Posted by Kai Wähner on  Jul 09, 2013

Although Hadoop is a set of an open source Apache (and now GitHub) projects, there are currently a large number of alternatives for installing a version of Hadoop and realizing big data processes. 3

Interview and Video Review: Working with Big Data: Infrastructure, Algorithms, and Visualizations

Posted by Aslan Brooke on  May 02, 2013

Paul Dix leads a practical exploration into Big Data in this video training series. The training focuses on the high level architecture while teaching practical usage skills and Ruby algorithms.

Hadoop Virtual Panel

Posted by Boris Lublinsky on  Nov 20, 2012

In this virtual panel, InfoQ talks to several Hadoop vendors and users about their views at current and future state of Hadoop.

Generating Avro Schemas from XML Schemas Using JAXB

Posted by Benjamin Fagin on  Mar 06, 2012

In his new article Benjamin Fagin discuss how to write XJC plugins and use this technique to generate AVRO schemes and marshaling classes directly from existing XSD files 8

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT