InfoQ Homepage MapReduce Content on InfoQ

News

RSS Feed

Newer Older

Hadoop Jobs on GPU with ParallelX

The MapReduce paradigm is not always ideal when dealing with large computationally intensive algorithms. A small team of entrepreneurs is building a product called ParallelX to solve that bottleneck by harnessing the power of GPUs to give Hadoop jobs a significant boost.

Charles Menguy
on Dec 26, 2013
Apache Tez - a Generalization of the MapReduce Data Processing

A new Apache incubator project, Tez, generalizes the MapReduce paradigm to execute a complex DAG (directed acyclic graph) of tasks.

Boris Lublinsky
on Sep 20, 2013
QuantCell Research Announces First Public Beta of their Java-Aware Big-Data Spreadsheet

Big Data analytics startup QuantCell Research has announced the release of the first public beta of what they are positioning as their "Big Data" spreadsheet.

Victor Grazi
on Aug 21, 2013
Trends in the latest Technology Radar

ThoughtWorks's latest "Technology Radar" focuses on mobile, accessible analytics, simple architectures, reproducible environments, and data persistence done right.

Aslan Brooke
on Jan 18, 2013
Windows Azure Storage New Pricing Structure Revealed

Microsoft recently revealed new pricing structure for Windows Azure Storage along with several improvements.

Anand Narayanaswamy
on Dec 11, 2012
LinkedIn Engineering Releases SenseiDB 1.0.0

LinkedIn engineering releases SenseiDB 1.0.0, a NoSQL database focused on high update rates and complex semi-structured search queries, already used in production by LinkedIn in its search related pages (e.g. People/Company search)

Kostis Kapelonis
on Mar 19, 2012
MapReduce Patterns, Algorithms, and Use Cases

In his new article “MapReduce Patterns, Algorithms, and Use Cases”, Ilya Katsov gives a systematic view of the different MapReduce patterns, algorithms and techniques that can be found on the web or in scientific articles along with several practical use case studies.

Boris Lublinsky
on Feb 08, 2012
Apache Hadoop 1.0.0 Supports Kerberos Authentication, Apache HBase and RESTful API to HDFS

After six years of gestation, Big data framework Apache Hadoop 1.0.0 was recently released. Core features in the release include Kerberos Authentication, support for Apache HBase and RESTful API to HDFS. InfoQ spoke with Arun Murthy, VP of Apache Hadoop, about the new release.

Srini Penchikala
on Jan 13, 2012
Blog Sentiment Analysis Using NoSQL Techniques

Corporations are increasingly using social media to learn more about what their customers are saying about their products. This presents unique challenges as unstructured content needs analytic techniques to interpret the sentiment embodied in the blog posts. InfoQ caught up with Subramanian Kartik to learn more about the blog sentiment analysis project his team worked on.

Srini Penchikala
on Dec 28, 2011
HPCC Systems Launches Big Data Delivery Engine on EC2

HPCC Systems, which is part of LexisNexis, is launching this week its Thor Data Refinery Cluster on the Amazon EC2. HPCC Systems is an enterprise-grade, open source Big Data analytics technology platform capable of ingesting vast amounts of data, transforming, linking and indexing that data, with parallel processing power spread across the nodes.

Jean-Jacques Dubray
on Dec 01, 2011
Big Data: Evolution or Revolution?

Recently Steve Jones, from Cap Gemini, questioned whether NoSQL/Big Data is the panacea that some vendors would have us believe. He suggests that in some cases in-memory RDBMS may well be the optimal solution and that approaches such as Map Reduce could be too difficult to understand for typical IT departments. He concludes with a suggestion some sometimes Big Data may be a Big Con.

Mark Little
on Nov 13, 2011
Hortonworks Announces Hadoop Data Platform

Hortonworks, a company created in June 2011 by Yahoo! and Benchmark Capital, has announced the Technical Preview Program of Data Platform based on Hadoop. The company employs many of the core Hadoop contributors and intends to provide support and training.

Abel Avram
on Nov 01, 2011
AWS Targets Scientific Community with New Resources for High Performance Computing

The Amazon Web Services (AWS) team announced a set of resources targeting the high performance computing needs of the scientific community. AWS specifically highlights their “spot pricing” market as a way to do cost-effective, massive scale computing in Amazon cloud environment.

Richard Seroter
on Sep 29, 2011
MapR Releases Commercial Distributions based on Hadoop

MapR Technologies released a big data toolkit, based on Apache Hadoop with their own distributed storage alternative to HDFS. The software is commercial, with both a free edition, M3, as well as a paid edition, M5. M5 includes snapshots and mirroring for data, Job Tracker recovery, and commercial support. MapR's M5 edition will form the basis of EMC Greenplum's upcoming HD Enterprise Edition.

Ron Bodkin
on Jul 07, 2011
Yahoo Hadoop Spinout Hortonworks Announces Plans

Yahoo spun-out its core Hadoop team, forming a new company Hortonworks. CEO Eric Baldeschwieler presented their vision of easing adoption of Hadoop and making core engineering improvements for availability, performance, and manageability. Hortonworks will sell support, training, and certification, primarily indirects through partners.

Ron Bodkin
on Jun 29, 2011

Newer News

Older News

InfoQ Software Architects' Newsletter

News