BT

Hadoop Jobs on GPU with ParallelX

by Charles Menguy on  Dec 26, 2013 1

The MapReduce paradigm is not always ideal when dealing with large computationally intensive algorithms. A small team of entrepreneurs is building a product called ParallelX to solve that bottleneck by harnessing the power of GPUs to give Hadoop jobs a significant boost.

A Survey and Interview on How Hadoop Is Used Today

by Boris Lublinsky on  Dec 12, 2013

This post presents the results of a Hortonworks survey of over 500 Hadoop Summit 2013 attendees on how they use Hadoop, and an interview with David McJannet on Hadoop trends today.

Open Source SQL-in-Hadoop Solutions: Where Are We?

by Michael Hausenblas on  Dec 10, 2013

With Facebook recently releasing Presto as open source, the already crowded SQL-in-Hadoop market just became a tad more intricate. A number of open source tools are competing for the attention of developers: Hortonworks Stinger initiative around Hive, Apache Drill, Apache Tajo, Cloudera’s Impala, Salesforce’s Phoenix (for HBase) and now Facebook’s Presto.

A Few Highlights from QConSF2013- Part 1 of 2

by Martin Monroe on  Nov 30, 2013

On each day of the 3-day conference at the inviting environs offered at the Hyatt there was a jam-packed schedule of speakers, exhibits and activities that made for some difficult decisions as to which tracks and what happening to attend.

Cascading 2.5 Supports Hadoop 2

by Boris Lublinsky on  Nov 19, 2013

New version of Cascading released this week incorporates Hadoop 2 support and includes Cascading Lingual - an open source project that provides a comprehensive ANSI SQL interface for accessing Hadoop-based data

YARN Brings New Capabilities To Hadoop

by Roopesh Shenoy on  Oct 23, 2013

Hadoop 2 is now Generally Available, with YARN bringing ability to build data-processing applications that work natively in Hadoop. We spoke to Rohit Bakhshi, product manager at Hortonworks, about YARN and what it means for Hadoop users.

QuantCell Research Announces First Public Beta of their Java-Aware Big-Data Spreadsheet

by Victor Grazi on  Aug 21, 2013

Big Data analytics startup QuantCell Research has announced the release of the first public beta of what they are positioning as their "Big Data" spreadsheet.

Best Practices for Amazon EMR

by Boris Lublinsky on  Aug 16, 2013 2

In his new whitepaper, Best Practices for Amazon EMR, Parviz Deyhim outlines the best practices in using AWS EMR including moving data to AWS, strategies for collecting, compressing, aggregating the data, and common architectural patterns for setting up and configuring Amazon EMR clusters for processing.

Concurrent Releases Pattern, a Machine Learning DSL for Hadoop

by Boris Lublinsky on  May 20, 2013

Concurrent, Inc., the enterprise Big Data application platform company, today announced Pattern, a machine learning based on an industry standard called PMML which allows analytics frameworks such as SAS, R, Microstrategy, Oracle, etc., to export predictive models and run them on Hadoop clusters

Windows Azure Updated with Hadoop, HTML5/JS, CORS, PhoneGap, Mercurial and Dropbox

by Anand Narayanaswamy on  Mar 26, 2013

The recently released Windows Azure updates include support for Hadoop service, HTML5/JS, CORS, PhoneGap including Mercurial, Dropbox, CodePlex and Bitbucket deployment integration.

DataStax Brings Enterprise Security To Cassandra, Hadoop, Solr

by Roopesh Shenoy on  Mar 18, 2013

Datastax Enterprise 3.0 was announced last month with several Enterprise security features for a cluster using Cassandra, Hadoop and Solr. InfoQ caught up with Robin Schumacher, VP of Products at DataStax to learn more.

Concurrent Releases Lingual, a SQL DSL for Hadoop

by Boris Lublinsky on  Feb 28, 2013

Concurrent, Inc., the enterprise Big Data application platform company, today announced Lingual, an open source project enabling fast and simple Big Data application development on Apache Hadoop using SQL.

Greenplum Pivotal HD Combines the Strengths of SQL and Hadoop

by Abel Avram on  Feb 27, 2013

EMC Greenplum has announced Pivotal HD, a new Hadoop distribution including a fully compliant SQL MPP database running on HDFS and being “hundreds of times faster than Hive”.

Competition between Real-time Hadoop Implementations Heats Up

by Boris Lublinsky on  Feb 25, 2013 7

Hortonworks’ new Stinger initiative joins Apache Drill and Cloudera Impala in competition for the best real-time Hadoop implementation.

A Look at Oracle’s NoSQL Database

by Jonathan Allen on  Feb 08, 2013 4

Oracle’s key-value database, known simply as “Oracle NoSQL Database” has hit version 2.0. Oracle NoSQL Database is essentially a distributed frontend for Berkeley DB, but it offers much more than that. Support for SQL queries, both absolute and eventual consistency, and the option to reduce storage space using Avro schemas sets it apart.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT