InfoQ Homepage Apache Hadoop Content on InfoQ

News

RSS Feed

Newer Older

Project Myriad: Mesos and YARN Working Together

An article by Jin Scott - A tale of two clusters: Mesos and YARN – describes hardware silos created by using different resource managers on different hardware clusters, most popular being Mesos and Yarn and introduces Myriad – a solution allowing to run a YARN cluster on Mesos.

Boris Lublinsky
on Feb 14, 2015
EMRFS Brings Consistency to Amazon S3

Amazon recently announced EMRFS, an implementation of HDFS that allows EMR clusters to use S3 with a stronger consistency model. When enabled, this new feature keeps track of operations performed on S3 and provides list consistency, delete consistency and read-after-write-consistency, for any cluster created with Amazon Machine Image (AMI) version 3.2.1 or greater.

Jérôme Serrano
on Jan 27, 2015
LinkedIn Open Sources Cubert With an Eye To Big Data Analytics

LinkedIn recently open sourced Cubert, its High Performance Computation Engine for Complex Big Data Analytics. Cubert is a framework written for analysts and data scientists in mind.Developed completely in Java and expressed as a scripting language, Cubert is designed for complex joins and aggregations that frequently arise in the reporting world.

Alex Giamas
on Dec 17, 2014
Stripe Open Sources Tools For Apache Hadoop

Stripe, the internet payments infrastructure company recently announced open sourcing a set of internally developed tools based on Apache Hadoop.Timberlake, Brushfire, Sequins and Herringbone all contribute to enriching the available tools for building an Apache Hadoop stack.

Alex Giamas
on Dec 09, 2014
Microsoft Expands Azure Machine Learning and Real Time Analytics Offering

Microsoft recently announced new machine learning capabilities for Microsoft Azure platform. Developers can also create their own web services and publish them to Azure Marketplace. Microsoft also announced availability of Apache Storm for Azure. Azure Stream Analytics, Data Factory and Event Hubs for Azure were all announced in the past few weeks by Microsoft. In this article we explore moreabout

Alex Giamas
on Oct 31, 2014
Hortonworks Announces Stinger.next Roadmap to Deliver Hadoop Scale SQL with Apache Hive

Following on from the Stinger initiative delivered in Apache Hive 0.13, Hortonworks has laid out the Stinger.next roadmap to provide fully ACID transactions, a sub-second query engine, and more complete SQL 2011 analytics support, all driving towards the goal of “enhancing the speed, scale and breadth of SQL support” in Hive.

Adam Berry
on Sep 25, 2014
Hadoop Summit 2014 Day One - On the Path to Enterprise Grade Hadoop

Hadoop Summit Day One report covers the important trends and changes from last year's summit. It also covers the important announcements of the day in relation to this year's trending topics. This report focuses on the platform specific innovations and announcements and not the broader partner ecosystem, which will be covered in the next few days.

Jeevak Kasarkod
on Jun 04, 2014
Community the Focus at ApacheCON NA 2014

This year's ApacheCON North America conference saw key speakers focus on open source and its community. With more than 400 attendees, over 70 projects represented and 180 conference sessions it covered as many diverse topics as diverse the Apache Software Foundation projects are.

Carlos Sanchez
on May 15, 2014
Introducing Microsoft Avro

Microsoft has announced their implementation of the Apache Avro wire protocol. Avro is described a “compact binary data serialization format similar to Thrift or Protocol Buffers” with additional features needed for distributed processing environments such as Hadoop.

Jonathan Allen
on May 08, 2014
Coverity Scan Gets Better with Java, Apache Hadoop, HBase and Cassandra Support

The recently released open source scan report by Coverity mainly detected and fixed Resource Leaks, Null Pointer and Control Flow issues besides several other issues. It also scanned the source code of Linux and fixed several bugs.

Anand Narayanaswamy
on May 02, 2014
Cloudera Partners with MongoDB to Store Hadoop Data on Their NoSQL DB

Starting from the premise that today “80 percent of enterprise data is unstructured and growing at twice the rate of structured data”, Cloudera and MongoDB have announced a “strategic” partnership meant to provide customers the option to combine Cloudera’s Apache-based Big Data platform with MongoDB’s NoSQL solution.

Abel Avram
on Apr 29, 2014
Hadoop Gets Better Security, Several Operational Improvements

Hadoop 2.4.0 was recently released with several enhancements to both HDFS and YARN. This includes support for Access Control Lists, Native support for Rolling upgrades, Full HTTPS support for HDFS, Automatic failover of YARN and other operational improvements

Roopesh Shenoy
on Apr 18, 2014
Hydra Takes On Hadoop

The social-networking company AddThis open-sourced Hydra under the Apache version 2.0 License in a recent announcement. Hydra grew from an in-house platform created to process semi-structured social data as live streams and do efficient query processing on those data sets.

Rags Srinivas
on Apr 11, 2014
Spark Officially Graduates From Apache Incubator

Recently, Spark graduated from the Apache incubator. Spark claims up to 100x speed improvements over Apache Hadoop over in-memory datasets and gracefully falling back to 10x speed improvement for on-disk performance. Based on Scala, it can run SQL queries and be used directly in R. It provides Machine Learning, Graph database capabilities and other further discussed in the article.

Alex Giamas
on Feb 28, 2014
Interactive SQL in Apache Hadoop with Impala and Hive

In the race for interactive SQL in Big Data environments, there are two open source based front-runners, Impala and Hive with the Stinger project. Cloudera recently announced that Impala is up to 69 times faster than Hive 0.12 and can outperform DBMS. Other than raw speed, we take a look at other considerations in choosing a SQL engine for Hadoop and also Tez, an application framework for YARN.

Alex Giamas
on Feb 07, 2014

Newer News

Older News

InfoQ Software Architects' Newsletter

News