"Fossies" - the Fresh Open Source Software Archive

Member "tika-1.21/tika-server/README.md" (14 May 2019, 2110 Bytes) of package /linux/misc/tika-1.21-src.zip:


As a special service "Fossies" has tried to format the requested source page into HTML format (assuming markdown format). Alternatively you can here view or download the uninterpreted source code file. A member file download can also be achieved by clicking within a package contents listing on the according byte size field. See also the last Fossies "Diffs" side-by-side code changes report for "README.md": 1.17-src_vs_1.18-src.

Apache Tika JAX-RS Server

https://issues.apache.org/jira/browse/TIKA-593

Running

$ java -jar tika-server/target/tika-server.jar --help
   usage: tikaserver
    -?,--help           this help message
    -h,--host <arg>     host name (default = localhost)
    -l,--log <arg>      request URI log level ('debug' or 'info')
    -p,--port <arg>     listen port (default = 9998)
    -s,--includeStack   whether or not to return a stack trace
                        if there is an exception during 'parse'

Running via Docker

Assuming you have Docker installed, you can build you own local image using the:

mvn dockerfile:build

The image will be named apache/tika with the tag being the version being built. For example, building Apache Tika Server 1.17 will result in an image of apache/tika-server:1.17

You can then run this image by executing the following, replacing 1.17 with your build version:

docker run -d -p 9998:9998 apache/tika-server:1.17

This will load Apache Tika Server and expose its interface on:

http://localhost:9998

Usage

Usage examples from command line with curl utility:

HTTP Return Codes

200 - Ok
204 - No content (for example when we are unpacking file without attachments)
415 - Unknown file type
422 - Unparsable document of known type (password protected documents and unsupported versions like Biff5 Excel)
500 - Internal error