InfoQ

InfoQ

News

My Bookmarks

Login or Register to enable bookmarks for unlimited time.

The content has been bookmarked!

There was an error bookmarking this content! Please retry.

Cloudera Enterprise Released: Interview with Charles Zedlewski

Posted by Tim Cull on Aug 24, 2010

Sections
Architecture & Design,
Development,
Operations & Infrastructure
Topics
Java ,
Cloud Computing ,
Deployment / Datacenter
Tags
Cloudera ,
Hadoop

Cloudera announced at the end of June that it has released Cloudera Enterprise, a commercial packaging of the open-source project Apache Hadoop and many other supporting projects.  The intent behind Cloudera Enterprise is to expand Hadoop's reach further into the corporate world. 

Hadoop and its sub-projects like Hive and HDFS have been covered extensively here on InfoQ.  For the latest update on Cloudera as a commercial packaging of those projects, InfoQ caught up with Charles Zedlewski, the Senior Director of Product Management at Cloudera:

This is for companies that want to run Hadoop in production.  So you've got the first use case to work, yeah it's great Hadoop does everything that people said it would.  Now you've got a second use case and a third and a fourth.  This is starting to take over fairly serious workloads inside my business....for a typical enterprise their mindset is ah, ok, now this Hadoop will be a part of my datacenter for some time to come. There's a whole bunch of things a company expects from a platform that will live for any protracted period of time.  That's really what [Cloudera] Enterprise is for.

We asked Mr. Zedlewski to describe some of these expectations. He suggested that enterprise operators:

...need some consistent behavior out of this platform: I need to be able to tie it to some SLAs.  I need it to integrate with and snap into the other tools and technologies I own and fit in with the other IT policies I have.  I need to be able to administrate it and it can't be that I have to hire somebody out of Facebook or Google to get it working.  It needs to be that with training I can get it working with the people I have on staff today.

Zedlewski went on to describe the various tools for analysts that allow them "to work in the tool they already know but just point it at this big processing engine called Hadoop."  The list included current and future Cloudera partners like Karmasphere, Datamere, MicroStrategy, Quest Software , and "a ton more."

Zedlewski's view is that in the future the majority of users will work with Hadoop through these sorts of tools rather than directly:

...at Facebook, Yahoo! and even Twitter, currently 90% of the workload that goes into Hadoop is no longer originating in MapReduce.  Instead users originate with higher level things like Pig or Hive which are compiling into MapReduce.  MapReduce is like the [operating system] kernel everyone uses but doesn't interact with any more.

However, Mr. Zedlewski went on to explain that there will not be a one-size-fits-all interface to Hadoop, saying, "the idea that there will be one UI that everyone passes through to get to Hadoop would be a big mistake.  If you think about databases there's not one UI that everyone passes through to do things with databases and neither will that be true for Hadoop."

Before signing off, we asked Mr. Zedlewski for a preview of what may be coming in advance of Hadoop World in October and he wrapped up with: "our current plan is to do one final Beta update to our distribution for Hadoop, so we'll have a number of significant enhancements to talk about...On the enterprise side, it will be principally a preview."

No comments

Watch Thread Reply

Educational Content

New-age Transactional Systems - Not Your Grandpa's OLTP

John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.

Cool Code

Kevlin Henney examines code samples to see what can be learned from them starting from the premise that one won’t write great code unless he knows how to read it.

Collaboration: At the Extremities of Extreme

Jason Ayers share the observations he made watching a team of developers collaborating in real time on the same code base, pushing XP, pair programming and continuous integration to their extremes.

Yesod Web Framework

Michael Snoyman presents Yesod, a web framework written in Haskell and containing a web server, templating, ORM, libraries (templating, gravatar, etc.).

Transactions without Transactions

Richard Kreuter and Kyle Banker on how to avoid classical RDBMS transactional systems by using compensation mechanisms, transactional messaging or transactional procedures.

Attila Szegedi on JVM and GC Performance Tuning at Twitter

Attila Szegedi talks about performance tuning Java and Scala programs at Twitter: how to approach GC problems, the importance of asynchronous I/O, when to use MySQL/Cassandra/Redis, and much more.

10 tips on how to prevent business value risk

One category of risk that project teams need to ensure they address is business value failure – delivering a product that fails to provide value for the business investor.

Interview: Software Systems Architecture: Working With Stakeholders Using Viewpoints and Perspectives

InfoQ spoke to the authors of Software Systems Architecture on a couple of new topics, the System Context viewpoint and Agile, which have been added to the second edition.