Public Repository

Last pushed: 2 years ago
Short Description
Apache Tika - a content analysis toolkit
Full Description

The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.

Available versions:

  • tika-server 1.9
Docker Pull Command