Apache Tika 1.0 Allows Easy Text Extraction for Java

by Fabian Lange on  Dec 28, 2011

InfoQ interviewed Chris Mattman from Apache Tika, a text extraction and detection library, in the occasion of the 1.0 release and the publication of the "Tika in Action" book.