BT

Hadoop-as-a-Service Provider Qubole Now Runs on Google Compute Engine

by Michael Hausenblas on  Dec 28, 2013

Qubole, a managed Hadoop-as-a-Service offering is now available on Google Compute Engine (GCE). Qubole was so far only available on Amazon's AWS and this announcement follows only a few days after Google releasing GCE into general availability.

Hadoop Jobs on GPU with ParallelX

by Charles Menguy on  Dec 26, 2013 1

The MapReduce paradigm is not always ideal when dealing with large computationally intensive algorithms. A small team of entrepreneurs is building a product called ParallelX to solve that bottleneck by harnessing the power of GPUs to give Hadoop jobs a significant boost.

Elastic Mesos service automates Mesos cluster deployment in EC2

by Charles Menguy on  Dec 17, 2013

EC2 users can now automate the deployment of Apache Mesos, an open-source tool to share cluster resources between multiple data processing frameworks, at scale through a new web service called Elastic Mesos provided by Big Data startup Mesosphere.

Martin Fowler on Data Austerity

by Jonathan Allen on  Dec 17, 2013

Martin Fowler writes about the opposite of Big Data, Datensparsamkeit. This German word roughly translates to “data austerity” or simply “not storing more than you need”.

A Survey and Interview on How Hadoop Is Used Today

by Boris Lublinsky on  Dec 12, 2013

This post presents the results of a Hortonworks survey of over 500 Hadoop Summit 2013 attendees on how they use Hadoop, and an interview with David McJannet on Hadoop trends today.

Big Data at Netflix Drives Business Decisions

by Alex Giamas on  Dec 12, 2013

Jeff Magnusson from Netflix team gave a presentation at QCon SF 2013 Conference about their Data Platform as a Service. Following up to this presentation, we will look at the technology stack and how it helps Netflix to tackle important business decisions.

Open Source SQL-in-Hadoop Solutions: Where Are We?

by Michael Hausenblas on  Dec 10, 2013

With Facebook recently releasing Presto as open source, the already crowded SQL-in-Hadoop market just became a tad more intricate. A number of open source tools are competing for the attention of developers: Hortonworks Stinger initiative around Hive, Apache Drill, Apache Tajo, Cloudera’s Impala, Salesforce’s Phoenix (for HBase) and now Facebook’s Presto.

Amazon re:invent roundup

by Chris Swan on  Dec 02, 2013

Amazon announced a number of new services at the recent re:invent conference in Las Vegas: Amazon WorkSpaces - Desktop Computing in the Cloud, Identity and Access Management using SAML, Amazon AppStream - Delivering Streaming Applications from the Cloud, Amazon Kinesis - Streaming Big Data, CloudTrail - Capturing AWS API Activity, Postgres support in RDS and new EC2 instance types

Increasing Pace of Change Drives Agile In Enterprise Applications

by Shane Hastie on  Nov 30, 2013

The pace of organizational change and technology adoption is increasing which means that enterprise software development needs to find ways to keep pace with these changes. The rise of big data is also driving the need to undertake many experiment and adapt rapidly. Blogger Matt Asay recently wrote about this in a post titled "Hey, Enterprise Developers! Get Agile Or Get Steamrollered"

Streaming Big Data With Amazon Kinesis

by Roopesh Shenoy on  Nov 25, 2013

Amazon recently announced Kinesis, a service that allows developers to stream large amounts of data from different sources and process it. The service is currently in limited preview.

Cascading 2.5 Supports Hadoop 2

by Boris Lublinsky on  Nov 19, 2013

New version of Cascading released this week incorporates Hadoop 2 support and includes Cascading Lingual - an open source project that provides a comprehensive ANSI SQL interface for accessing Hadoop-based data

Presto: Facebook’s Distributed SQL Query Engine

by Jonathan Allen on  Nov 12, 2013

Facebook has open-sourced Presto, their distributed SQL query engine. Presto uses a pipelined architecture rather than the Map/Reduce design found elsewhere. In production since early this year, Facebook has since “deployed in multiple geographical regions and [they] have successfully scaled a single cluster to 1,000 nodes”.

AnyPresence Soups up Enterprise MBaaS Platform- Part 1 of 2

by Martin Monroe on  Oct 30, 2013

Mobile Backend as a Service provider AnyPresence continues to hone their chops. Launching the fifth update to their self-titled platform geared for the enterprise. Co-founder Rich Mendis provides some insights for InfoQ readers…

Introducing SQL Server 2014's New Clustered Columnstore Indexes

by Jonathan Allen on  Sep 26, 2013

SQL Server 2014 will offer Clustered Columnstore Indexes. These will offer the performance and compression benefits of column-oriented storage without the need to restrict the underlying table to read-only access.

Apache Tez - a Generalization of the MapReduce Data Processing

by Boris Lublinsky on  Sep 20, 2013 1

A new Apache incubator project, Tez, generalizes the MapReduce paradigm to execute a complex DAG (directed acyclic graph) of tasks.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2014 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT