"Fossies" - the Fresh Open Source Software Archive

Contents of tika-app-1.24.1.jar (21 Apr 20:40, 74859011 Bytes)

About: Apache Tika is a content analysis toolkit that detects and extracts metadata and structured text content from various documents. Runnable jar file.



Fossies downloads: /linux/misctika-app-1.24.1.jar  (tar.gz|tar.bz2|tar.xz)
Fossies services: CLOC analysis
Original URL: https://downloads.apache.org/tika/tika-app-1.24.1.jar
Home page: https://tika.apache.org/
VirusTotal check: Ok
Member paths+URLs:  Full
Member sort order:  docs related (infos|docs|other) | original | size (top100) | date | path | name | ext | top-path files

Archive:  tika-app-1.24.1.jar
  Length      Date    Time    Name
---------  ---------- -----   ----
Basic infos (README, FAQ, INSTALL, ChangeLog, ...):
    10363  2016-10-07 11:20   LICENSE
      243  2013-12-21 08:22   mchange-config-resource-paths.txt
      773  2016-10-07 11:20   README.md
       18  2020-04-10 17:00   version2.txt
    27663  2020-04-17 17:18   META-INF/DEPENDENCIES
    11560  2009-12-09 13:02   license/LICENSE
    60653  2020-04-17 17:18   META-INF/LICENSE
      180  2020-04-17 17:18   META-INF/NOTICE
      895  2009-12-09 13:02   license/NOTICE
     2287  2009-12-09 13:02   license/README.dom.txt
      713  2009-12-09 13:02   license/README.sax.txt

Basic docs (manual pages, PDF-,HTML-,/doc/-files, ...):
     4010  2009-12-09 13:02   license/LICENSE.dom-documentation.txt
     3759  2020-04-17 17:18   org/jdom2/DocType.class
    12604  2020-04-17 17:18   org/jdom2/Document.class
     2059  2020-04-17 17:18   opennlp/tools/doccat/BagOfWordsFeatureGenerator.class
     3285  2020-04-17 17:18   opennlp/tools/doccat/DoccatCrossValidator.class
      320  2020-04-17 17:18   opennlp/tools/doccat/DoccatEvaluationMonitor.class
     4388  2020-04-17 17:18   opennlp/tools/doccat/DoccatFactory.class
     3096  2020-04-17 17:18   opennlp/tools/doccat/DoccatModel.class
      923  2020-02-03 20:51   schemaorg_apache_xmlbeans/system/sD023D6490046BA0250A839A9AD24C443/document2bd9doctype.xsb
      430  2019-03-22 21:28   schemaorg_apache_xmlbeans/system/sXMLSCHEMA/documentation6cdbdoctype.xsb
      629  2019-03-22 21:28   schemaorg_apache_xmlbeans/system/sXMLSCHEMA/documentationa475elemtype.xsb
      153  2019-03-22 21:28   schemaorg_apache_xmlbeans/system/sXMLSCHEMA/documentationelement.xsb
      450  2020-02-03 20:52   schemaorg_apache_xmlbeans/system/s8C3F193EE11A2F798ACF65489B9E6078/documentationreferencestype4bbetype.xsb
       61  2019-03-22 21:28   schemaorg_apache_xmlbeans/element/http_3A_2F_2Fwww_2Ew3_2Eorg_2F2001_2FXMLSchema/documentation.xsb
      858  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizer.class
     1714  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizerContextGenerator.class
     2458  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizerEvaluator.class
     1991  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizerEventStream$1.class
     2386  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizerEventStream.class
     6032  2020-04-17 17:18   opennlp/tools/doccat/DocumentCategorizerME.class
     2059  2020-04-17 17:18   org/w3c/dom/Document.class
     8775  2020-04-17 17:18   org/jsoup/nodes/Document.class
      634  2020-02-03 20:51   schemaorg_apache_xmlbeans/system/sD023D6490046BA0250A839A9AD24C443/documentelement.xsb
      104  2020-04-17 17:18   org/w3c/dom/DocumentFragment.class
      207  2020-04-17 17:18   opennlp/tools/namefind/DocumentNameFinder.class
     3253  2020-04-17 17:18   opennlp/tools/doccat/DocumentSample.class
     1911  2020-04-17 17:18   opennlp/tools/doccat/DocumentSampleStream.class
     2649  2020-04-17 17:18   opennlp/tools/formats/DocumentSampleStreamFactory.class
     2306  2020-02-03 20:51   schemaorg_apache_xmlbeans/system/sD023D6490046BA0250A839A9AD24C443/documentsettingstype945btype.xsb
     1314  2020-02-03 20:51   schemaorg_apache_xmlbeans/system/sD023D6490046BA0250A839A9AD24C443/documentsheettype5ca7type.xsb
      289  2020-04-17 17:18   org/w3c/dom/DocumentType.class
     3644  2020-04-17 17:18   org/jsoup/nodes/DocumentType.class
       84  2020-02-03 20:51   schemaorg_apache_xmlbeans/element/URI_SHA_1_19646AEC388215C989FB75EDE3F402FF063BA490/document.xsb
      360  2020-04-17 17:18   opennlp/tools/doccat/FeatureGenerator.class
     9165  2020-04-17 17:18   org/cyberneko/html/HTMLElements.class
     2907  2020-04-17 17:18   org/cyberneko/html/HTMLElements$Element.class
      950  2020-04-17 17:18   org/cyberneko/html/HTMLElements$ElementList.class
    22099  2020-04-17 17:18   org/cyberneko/html/HTMLTagBalancer.class
     1197  2020-04-17 17:18   org/cyberneko/html/HTMLTagBalancer$ElementEntry.class
     2306  2020-04-17 17:18   org/cyberneko/html/HTMLTagBalancer$Info.class
     1604  2020-04-17 17:18   org/cyberneko/html/HTMLTagBalancer$InfoStack.class
     2227  2020-04-17 17:18   opennlp/tools/doccat/NGramFeatureGenerator.class

First 50 (from 45921) other files:
    11785  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.128.table
    12795  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.129.table
     1543  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.130.table
     3965  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.131.table
      542  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.132.table
     5970  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.133.table
     3233  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.140.table
     1202  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.150.table
     3981  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.151.table
     5478  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.160.table
     6608  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.162.table
     1479  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.170.table
    10587  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.171.table
     1809  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.172.table
     2153  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.173.table
     2557  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.174.table
     1408  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.175.table
     1377  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.180.table
     1582  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.190.table
    13338  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.200.table
     4250  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.201.table
    13011  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.210.table
    10794  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.211.table
     8394  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.212.table
      347  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.213.table
     3908  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.214.table
    13775  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.215.table
     8394  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.216.table
      152  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.220.table
     3792  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.228.table
     2786  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.230.table
      280  2015-03-06 15:38   resources/grib1/ecmwfGribApi/2.98.234.table
    19451  2015-03-06 15:38   resources/thredds/4p2to4p3Remap.xml
     7149  2020-02-20 17:20   org/apache/fontbox/cmap/83pv-RKSJ-H
     6070  2020-02-20 17:20   org/apache/fontbox/cmap/90msp-RKSJ-H
     4281  2020-02-20 17:20   org/apache/fontbox/cmap/90msp-RKSJ-V
     6139  2020-02-20 17:20   org/apache/fontbox/cmap/90ms-RKSJ-H
     4298  2020-02-20 17:20   org/apache/fontbox/cmap/90ms-RKSJ-V
     7884  2020-02-20 17:20   org/apache/fontbox/cmap/90pv-RKSJ-H
     3784  2020-02-20 17:20   org/apache/fontbox/cmap/90pv-RKSJ-V
      164  2020-04-17 17:18   org/bouncycastle/operator/AADProcessor.class
      238  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$1.class
     2442  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$AbstractTreeNode.class
     8575  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet.class
      854  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$Node.class
     4496  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$SubSet.class
      962  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$SubTreeIterator.class
     2345  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$TerminalNode.class
     3007  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$TreeIterator.class
     2047  2020-04-17 17:18   net/sf/ehcache/store/disk/ods/AATreeSet$TreeNode.class
...

A hint: In order to limit the size of this page, in total 45871 archive member files - probably not "information" or "documentation" related - are omitted here. But all those files can be found in the complete docs-related index file or in the originally, by date, by pathname, by filename or by file extension sorted index files (roughly file size each: 10.7 MB !!!).
   MD5 (tika-app-1.24.1.jar): f53a6d48da81539a113d7376f383ead4
  SHA1 (tika-app-1.24.1.jar): 084fcd4b6d5425e2b65ce867f0a2474aedc2fcc0
SHA256 (tika-app-1.24.1.jar): e56d2e38be4755c78b511f316bda2a55af5c3b3b36e7e5536d3584c71239b187

Home  |  About  |  Features  |  All  |  Newest  |  Dox  |  Diffs  |  Codespell  |  RSS Feeds  |  Screenshots  |  Comments  |  Imprint  |  Privacy  |  HTTP(S)