InfoQ

InfoQ

Topic/Tag specific view

Lucene Content on InfoQ


Latest featured content about Lucene

Implementing Lucene Spatial Support

Topics
Search,
Big Data

Lucene geospatial extension proposed in this article is based on a two level search – first level search is based on Cartesian Grid search and the second level implements shape specific spatial calculations

News about Lucene

Apache Tika 1.0 Allows Easy Text Extraction for Java

Topics
Java,
Apache

InfoQ interviewed Chris Mattman from Apache Tika, a text extraction and detection library, in the occasion of the 1.0 release and the publication of the "Tika in Action" book.

Lucene 3.5 and Solr 3.5 - Substantial RAM Reduction, SearcherManager, Deep Paging Support

Topics
Search,
Java

The Lucene PMC (Project Management Committee) has announced the availability of Apache Lucene 3.5.0 and Apache Solr 3.5.0. Lucene is a high-performance, full-featured text search library. Solr is a standalone search server that uses Lucene at its core for indexing and search.

Neo4j: Java-based NoSQL Graph Database

Topics
NoSQL,
Java,
Database Design

After several years of development, the developers from NeoTechnology have released version 1.0 of Neo4j, a Java-based graph database which follows the property graph datamodel. InfoQ spoke with NeoTechnology COO Peter Neubauer to learn more about the current Neo4j release and what it offers to developers.

Apache Lucene 2.9 Released

Topics
Search,
Java,
Open Source

The Apache Software Foundation has released Lucene 2.9, the last minor release before 3.0. Amongst other features version 2.9 includes a number of performance improvements, and adds near real-time searching, native support for numeric range queries, and geospatial aware searching.

Articles about Lucene

Integrating Lucene with HBase

Topics
Search

The article describes overall design and implementation of integrating Lucene search library with HBase back end. It describes integration architecture, implementation and HBase tables design

Twitter Shifting More Code to JVM, Citing Performance and Encapsulation As Primary Drivers

Topics
Object Oriented Design,
Java,
Performance & Scalability

While it almost certainly remains the largest Ruby on Rails based site in the world, Twitter has gradually been moving more and more of its stack to the JVM. Last year the company announced that its back-end message queue had been re-written in Scala, and more recently it moved the search stack to Java, making Twitter search around three times faster.

Implementing Google's "Did you mean" Feature In Java

Topics
Search,
Java,
Open Source

Leandro Moreira shows how to implement a domain specific version of Google’s “Did you mean” feature based on the SpellChecker project in the Apache Lucene sandbox using thee alternative algorithms (Levenshtein, Jaro-Winkler and N-gram).

Presentations about Lucene

LinkedIn Search: Searching the Social Graph in Real Time

Topics
Search,
Stories & Case Studies,
Real Time,
Architecture

John Wang discusses LinkedIn real-time distributed search engine architecture and implementation details for People Search, Signal, Stream Indexing, Zoie, and Bobo.