InfoQ

InfoQ

News

My Bookmarks

Login or Register to enable bookmarks for unlimited time.

The content has been bookmarked!

There was an error bookmarking this content! Please retry.

Clustering Lucene with TerraCotta

Posted by Rob Thornton on Nov 08, 2006

Sections
Development,
Architecture & Design
Topics
Clustering & Caching ,
Search ,
Java
Tags
Lucene ,
Terracotta

Engineers at TerraCotta have detailed a new way to cluster lucene, the popular text search library from Apache. Their method involves implementing the lucene RAMDirectory interface and using TerraCotta DSO to share the RAMDirectory across JVMs.

Lucene is a popular, open-source text search library. There are several existing strategies to clustering Lucene. Steve and Orion at TerraCotta noted that it was used in several of the products they use in house and so decided it would be a good test of their clustering software. They first tried to cluster the lucene IndexWriter directly but they ran into some problems and so switched to using RAMDirectory.

Orion describes the RAMDirectory approach as being straightforward:

As you can see, dealing with the index is pretty simple. This example code isn't really any different with clustering enabled than it is without clustering. In fact, turning clustering on and off is as simple as invoking java with or without a couple of Terracotta options.

Orion has provided some example code, which he admits is unpolished, and promises to post a cleaned up version along with instructions soon.

All clean by ARI ZILKA Posted
  1. Back to top

    All clean

    by ARI ZILKA

    Orion has posted a cleaned up version for others to take advantage of:
    orionl.blogspot.com/

Educational Content

Jesper Boeg on Priming Kanban

In this interview, Jesper Boeg, author of the new InfoQ book – Priming Kanban, discusses the keys to using Kanban effectively, and how to get started if you are currently using other approaches.

New-age Transactional Systems - Not Your Grandpa's OLTP

John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.

Cool Code

Kevlin Henney examines code samples to see what can be learned from them starting from the premise that one won’t write great code unless he knows how to read it.

Collaboration: At the Extremities of Extreme

Jason Ayers share the observations he made watching a team of developers collaborating in real time on the same code base, pushing XP, pair programming and continuous integration to their extremes.

Yesod Web Framework

Michael Snoyman presents Yesod, a web framework written in Haskell and containing a web server, templating, ORM, libraries (templating, gravatar, etc.).

Transactions without Transactions

Richard Kreuter and Kyle Banker on how to avoid classical RDBMS transactional systems by using compensation mechanisms, transactional messaging or transactional procedures.

Attila Szegedi on JVM and GC Performance Tuning at Twitter

Attila Szegedi talks about performance tuning Java and Scala programs at Twitter: how to approach GC problems, the importance of asynchronous I/O, when to use MySQL/Cassandra/Redis, and much more.

10 tips on how to prevent business value risk

One category of risk that project teams need to ensure they address is business value failure – delivering a product that fails to provide value for the business investor.