InfoQ Homepage Presentations Hypertable - An Open Source, High Performance, Scalable Database
Hypertable - An Open Source, High Performance, Scalable Database
Summary
This presentation discusses Hypertable, an open source, high performance, distributed database modeled after Google's Bigtable. Doug discusses the differences between Hypertable and traditional database technology, support for massive sparse tables, scaling to petabytes size, and how Hypertable is designed to run on top of an existing distributed file system, such as the Hadoop DFS.
Bio
At Inktomi's Web Search division Doug designed and developed large-scale distributed systems. Later, at Kosmix, Inc., he built a distributed web crawler and scaled it to a billion documents. He also worked at Verity, helping to develop VDK. He is currently Principal Search Architect at Zvents, Inc., and holds four patents in search technology.
About the conference
QCon is a conference that is organized by the community, for the community.The result is a high quality conference experience where a tremendous amount of attention and investment has gone into having the best content on the most important topics presented by the leaders in our community. QCon is designed with the technical depth and enterprise focus of interest to technical team leads, architects, and project managers.
Community comments
GPL issue
by Dorel Vaida,
Re: GPL issue
by Cd MaN,
Apache Cassandra
by Paul Fremantle,
GPL issue
by Dorel Vaida,
Your message is awaiting moderation. Thank you for participating in the discussion.
They are aware that GPL forces the users of the package to also GPL their product, and so ... you need to provide a dual license model if you want to get any users that wouldn't want to publish their code ? And I'm pretty sure that this is the case with a very big percent of the users.
Re: GPL issue
by Cd MaN,
Your message is awaiting moderation. Thank you for participating in the discussion.
IANAL, but AFAIK the GPL doesn't force you to give your source code to your client, unless you deliver binaries to them. Ie. if you only create a website, or your service is exposed trough HTTP (HTTPS hopefully), you don't need to give your clients the source code. This is true for both GPLv2 and GPLv3.
There is a variation of the GPL called the AGPL (AGPLv2 and AGPLv3 more precisely), which does make it obligatory to deliver source code, even if you only expose your software trough a network service, but it is not as widely used as GPL.
Apache Cassandra
by Paul Fremantle,
Your message is awaiting moderation. Thank you for participating in the discussion.
There is another similar project called Apache Cassandra. Its still in the incubator. It offers the same BigTable like data model and uses the same Eventually Consistent model from Amazon's Dynamo. It is Apache licensed, which makes life much simpler for most people consuming it.
Here is a presentation from OSCON by Jonathan Ellis:
bit.ly/1tujuf
Paul