InfoQ

News

Aster In-Database MapReduce

Posted by R.J. Lorimer on Sep 21, 2008 09:58 PM

Community
Java
Topics
Database Design ,
Fault Tolerance ,
Platforms ,
Cloud Computing ,
Enterprise Architecture
Tags
MapReduce ,
Database
Aster Data Systems recently announced Aster In-Database MapReduce, a component of their nCluster database.

MapReduce has been discussed at length at InfoQ, and is a programming model originally introduced by engineers at Google as a scalable approach to processing large data-sets.

The nCluster database is labeled by Aster as a massively parallel processing (MPP) database. The parallel architecture of nCluster is described on their website in this way:
Aster nCluster is built on a unique, multi-tiered nCluster architecture which consists of three separate classes of nodes: Queens, Workers, and Loaders. The three-tier design encapsulates a clean separation of roles for analytic processing. Each tier can be independently and incrementally scaled in response to the workload characteristics – adding more capacity (Workers), loading bandwidth (Loaders), or concurrency (Queens) on an as-needed basis.
The MapReduce implementation provided in Aster nCluster allows for the execution of MapReduce calculations within the database, using this same architecture:
Just like its massively parallel execution environment for standard SQL queries, Aster nCluster now adds the ability to implement flexible MapReduce functions for parallel data analysis and transformation inside the database. Aster nCluster In-Database MapReduce functions are simple to write and are seamlessly integrated within SQL statements. They rely on SQL queries to manipulate the underlying data and provide input. The functions can procedurally manipulate such input data and provide outputs that can be further consumed by SQL queries or be written into tables within the database.
SQL/MR is a special SQL MapReduce function library introduced by Aster that can be used to invoke map-reduce algorithms within the nCluster platform. Aster supports polymorphic functions and dynamic typing, and MapReduce calculations may be developed in languages such as Java, Python, C++ and others.

More information about In-Database Map Reduce and the nCluster database is available on the Aster Data Systems website.

No comments

Watch Thread Reply

Educational Content

Bindings, Platforms, and Innovation

This presentation focuses on the Internet and separating myth from fact, history from the future, and the mundane from the imaginative. Bob Frankston presents a vision of what could and should be.

Orchestrating Long Running Activities with JBoss / JBPM

This article explores the use of JBoss and jBPM to implement design solutions that effectively address the issue of orchestrating long running activities.

Neo4j - The Benefits of Graph Databases

This presentation covers the use of graph databases as an optimal solution for data that is difficult to fit in static tables, rapidly evolving data or data that has a lot of optional attributes.

Realistic about Risk: Software development with Real Options

This session introduces Real Options and shows how it can help in running your project. Real Options is a decision-making process that can be used to manage risk.

Communication Flexibility Using Bindings

This article discusses the use of bindings on services and references (including the instance of non-configured bindings) as the means to implement SCA communications in a Web and SOA environment.

Writing DSLs in Groovy

After a short introduction to DSLs, Scott Davis plays with the keyboard showing how to approach the creation of a DSL by typing working snippets of Groovy code that get executed.

Scaling Agile with C/ALM (Collaborative Application Lifecycle Management)

IBM Rational and InfoQ present, Scaling Agile with C/ALM, an eBook showing organizations how to become “finely tuned software delivery machines” by enabling team integration and scaling.

Concurrent Programming with Microsoft F#

Amanda Laucher presents a real life enterprise application written in F#. She shows actual code snippets, explaining design decisions and suggesting how to use some of the F# constructs.