Facilitating the Spread of Knowledge and Innovation in Professional Software Development

Write for InfoQ


Choose your language

InfoQ Homepage Presentations Hypertable - An Open Source, High Performance, Scalable Database

Hypertable - An Open Source, High Performance, Scalable Database



This presentation discusses Hypertable, an open source, high performance, distributed database modeled after Google's Bigtable. Doug discusses the differences between Hypertable and traditional database technology, support for massive sparse tables, scaling to petabytes size, and how Hypertable is designed to run on top of an existing distributed file system, such as the Hadoop DFS.


At Inktomi's Web Search division Doug designed and developed large-scale distributed systems. Later, at Kosmix, Inc., he built a distributed web crawler and scaled it to a billion documents. He also worked at Verity, helping to develop VDK. He is currently Principal Search Architect at Zvents, Inc., and holds four patents in search technology.

About the conference

QCon is a conference that is organized by the community, for the community.The result is a high quality conference experience where a tremendous amount of attention and investment has gone into having the best content on the most important topics presented by the leaders in our community. QCon is designed with the technical depth and enterprise focus of interest to technical team leads, architects, and project managers.

Recorded at:

Jul 31, 2009

Hello stranger!

You need to Register an InfoQ account or or login to post comments. But there's so much more behind being registered.

Get the most out of the InfoQ experience.

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Community comments

  • GPL issue

    by Dorel Vaida,

    Your message is awaiting moderation. Thank you for participating in the discussion.

    They are aware that GPL forces the users of the package to also GPL their product, and so ... you need to provide a dual license model if you want to get any users that wouldn't want to publish their code ? And I'm pretty sure that this is the case with a very big percent of the users.

  • Re: GPL issue

    by Cd MaN,

    Your message is awaiting moderation. Thank you for participating in the discussion.

    IANAL, but AFAIK the GPL doesn't force you to give your source code to your client, unless you deliver binaries to them. Ie. if you only create a website, or your service is exposed trough HTTP (HTTPS hopefully), you don't need to give your clients the source code. This is true for both GPLv2 and GPLv3.

    There is a variation of the GPL called the AGPL (AGPLv2 and AGPLv3 more precisely), which does make it obligatory to deliver source code, even if you only expose your software trough a network service, but it is not as widely used as GPL.

  • Apache Cassandra

    by Paul Fremantle,

    Your message is awaiting moderation. Thank you for participating in the discussion.

    There is another similar project called Apache Cassandra. Its still in the incubator. It offers the same BigTable like data model and uses the same Eventually Consistent model from Amazon's Dynamo. It is Apache licensed, which makes life much simpler for most people consuming it.

    Here is a presentation from OSCON by Jonathan Ellis:


Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p

Allowed html: a,b,br,blockquote,i,li,pre,u,ul,p