BT

Your opinion matters! Please fill in the InfoQ Survey!

Managing Hadoop with Apache Ambari

| by Boris Lublinsky Follow 0 Followers on Dec 19, 2012. Estimated reading time: 1 minute |

A note to our readers: As per your request we have developed a set of features that allow you to reduce the noise, while not losing sight of anything that is important. Get email and web notifications by choosing the topics you are interested in.

With the increasing popularity of Hadoop, the issue of a proper management platform is moving to the forefront of the current issues. There are already several commercial Hadoop management platforms, such as Cloudera Enterprise Manager, but Apache Ambari is the first open source implementation of such a system. It is a web-based tool for provisioning, managing, and monitoring Apache Hadoop clusters. Currently Ambari supports the majority of Hadoop components including: HDFS, MapReduce, Hive, Pig, HBase, Zookeper, Sqoop, HCatalog, etc.

In his new blog post, "Apache Amabri: Hadoop Operations, Innovtation, and Enterprise Readiness," Hortonworks Vice President of Corporate Strategy Shaun Connolly emphasizes the following main achievements of Ambari during this year:

  • Simplified cluster provisioning with a step-by-step installation wizard
  • Pre-configured key operational metrics for instant insight into the health of Hadoop Core (Hadoop Distributed File System and MapReduce) and related projects such as HBase, Hive and HCatalog
  • Visualization and analysis of job and task execution to gain a better view into dependencies and performance
  • A complete RESTful API for exposing monitoring information and integrating with existing operational tools
  • An intuitive user interface that makes viewing information and controlling a cluster easy and productive

Ambari leverages Ganglia for metrics collection and Nagios for system alerting and will send emails when the administrator’s attention is needed (e.g., a node goes down, remaining disk space is low, etc).

Additionally, Ambari supports Hadoop security by supporting installation of secure (Kerberos-based) Hadoop clusters, providing role-based user authentication, authorization, and auditing and integration with LDAP and Active Directory for user management.

Apache Ambari is currently one of the 6 top Open Source Hadoop management tools. According Connolly, Ambari is an important part of Hadoop ecosystem because “stability and ease of management are two key requirements for enterprise adoption of Hadoop”.

Rate this Article

Adoption Stage
Style

Hello stranger!

You need to Register an InfoQ account or or login to post comments. But there's so much more behind being registered.

Get the most out of the InfoQ experience.

Tell us what you think

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread

The article seems incomplete. by 臧 秀涛

Maybe "Conno" needs an introduction.

Ambari manager installed cluster by lsp net

Does Ambari could manage a hadoop cluster witch has benn installed ?

Apache Ambari by Sonam Gupta

Nice post! Apache Ambari enables system administrators to provision, manage and monitor a Hadoop cluster, and also to integrate Hadoop with the existing enterprise infrastructure. More at
www.youtube.com/watch?v=1jMR4cHBwZE "

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Email me replies to any of my messages in this thread

3 Discuss

Login to InfoQ to interact with what matters most to you.


Recover your password...

Follow

Follow your favorite topics and editors

Quick overview of most important highlights in the industry and on the site.

Like

More signal, less noise

Build your own feed by choosing topics you want to read about and editors you want to hear from.

Notifications

Stay up-to-date

Set up your notifications and don't miss out on content that matters to you

BT