InfoQ

InfoQ

Topic/Tag specific view

Columnar Databases Content on InfoQ


Latest featured content about Columnar Databases

Implementing Lucene Spatial Support

Topics
HBase,
Big Data,
Database Design,
Columnar Databases,
Database,
Lucene,
Search

Lucene geospatial extension proposed in this article is based on a two level search – first level search is based on Cartesian Grid search and the second level implements shape specific spatial calculations

News about Columnar Databases

Apache Hadoop 1.0.0 Supports Kerberos Authentication, Apache HBase and RESTful API to HDFS

Topics
HBase,
Big Data,
NoSQL,
Database Design,
Columnar Databases,
Hadoop,
MapReduce,
Database,
Announcements

After six years of gestation, Big data framework Apache Hadoop 1.0.0 was recently released. Core features in the release include Kerberos Authentication, support for Apache HBase and RESTful API to HDFS. InfoQ spoke with Arun Murthy, VP of Apache Hadoop, about the new release.

eBay readies next generation search built with Hadoop and HBase

Topics
HBase,
Big Data,
NoSQL,
Database Design,
Columnar Databases,
Database,
Hadoop,
Search

eBay presented a keynote at Hadoop World, describing the architecture of its completely rebuilt search engine, Cassini, slated to go live in 2012. It indexes all the content and user metadata to produce better rankings and refreshes indexes hourly. It is built using Hadoop for hourly index updates and HBase to provide random access to item information.

Yahoo Hadoop Spinout Hortonworks Announces Plans

Topics
Map-Reduce,
HBase,
Big Data,
Apache,
Open Source,
Database Design,
Columnar Databases,
Announcements,
Architecture,
Web Servers,
Database,
Hive,
Programming,
Hadoop,
Hortonworks

Yahoo spun-out its core Hadoop team, forming a new company Hortonworks. CEO Eric Baldeschwieler presented their vision of easing adoption of Hadoop and making core engineering improvements for availability, performance, and manageability. Hortonworks will sell support, training, and certification, primarily indirects through partners.

Facebook on Hadoop, Hive, HBase, and A/B Testing

Topics
HBase,
Data Access,
Columnar Databases,
Database Design,
Deployment / Datacenter,
Database Management,
Operations,
Data Analysis,
Infrastructure,
Facebook,
Database,
Architecture,
Data Warehouse,
Data Warehousing,
Data Partitioning,
Performance & Scalability,
Data Visualization,
Hadoop,
Testing

The Hadoop Summit of 2010 included presentations from a number of large scale users of Hadoop and related technologies. Notably, Facebook presented a keynote and details information about their use of Hive for analytics. Mike Schroepfer, Facebook's VP of Engineering delivered a keynote describing the scale of their data processing with Hadoop.

Articles about Columnar Databases

Integrating Lucene with HBase

Topics
HBase,
Columnar Databases,
Database,
Search,
Lucene

The article describes overall design and implementation of integrating Lucene search library with HBase back end. It describes integration architecture, implementation and HBase tables design

Presentations about Columnar Databases

HBase @ Facebook

Topics
Messaging,
QCon London 2011,
Web Services,
Big Data,
HBase,
Database Design,
SOA,
QCon,
NoSQL,
Enterprise Architecture,
Columnar Databases,
Database,
Architecture,
Conferences,
Facebook

Kannan Muthukkaruppan overviews HBase, explaining what Facebook Messages is and why they chose HBase to implement it, their contribution to HBase, and what they plan to use it for in the future.

NoSQL at Twitter

Topics
HBase,
Strange Loop 2010,
NoSQL,
Strange Loop,
Data Analysis,
Columnar Databases,
Twitter,
Hadoop,
Architecture,
Database,
Conferences

Kevin Weil presents how Twitter does data analysis using Scribe for logging, base analysis with Pig/Hadoop, and specialized data analysis with HBase, Cassandra, and FlockDB.