Apache Tika

 The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. 

Version List

Version OS Platform Size(MB) Release Date URL
1.4 All All 24.5 2013-06-17 00:00:00.0 http://get.jenv.mvnsearch.org/download/tika/tika-1.4.zip