
Implementing Lucene Spatial Support
Lucene geospatial extension proposed in this article is based on a two level search – first level search is based on Cartesian Grid search and the second level implements shape specific spatial calculations

Lucene geospatial extension proposed in this article is based on a two level search – first level search is based on Cartesian Grid search and the second level implements shape specific spatial calculations
InfoQ interviewed Chris Mattman from Apache Tika, a text extraction and detection library, in the occasion of the 1.0 release and the publication of the "Tika in Action" book.
The Lucene PMC (Project Management Committee) has announced the availability of Apache Lucene 3.5.0 and Apache Solr 3.5.0. Lucene is a high-performance, full-featured text search library. Solr is a standalone search server that uses Lucene at its core for indexing and search.
After several years of development, the developers from NeoTechnology have released version 1.0 of Neo4j, a Java-based graph database which follows the property graph datamodel. InfoQ spoke with NeoTechnology COO Peter Neubauer to learn more about the current Neo4j release and what it offers to developers.
The Apache Software Foundation has released Lucene 2.9, the last minor release before 3.0. Amongst other features version 2.9 includes a number of performance improvements, and adds near real-time searching, native support for numeric range queries, and geospatial aware searching.

The article describes overall design and implementation of integrating Lucene search library with HBase back end. It describes integration architecture, implementation and HBase tables design

While it almost certainly remains the largest Ruby on Rails based site in the world, Twitter has gradually been moving more and more of its stack to the JVM. Last year the company announced that its back-end message queue had been re-written in Scala, and more recently it moved the search stack to Java, making Twitter search around three times faster.

Leandro Moreira shows how to implement a domain specific version of Google’s “Did you mean” feature based on the SpellChecker project in the Apache Lucene sandbox using thee alternative algorithms (Levenshtein, Jaro-Winkler and N-gram).
John Wang discusses LinkedIn real-time distributed search engine architecture and implementation details for People Search, Signal, Stream Indexing, Zoie, and Bobo.