mattporritt / moodle-search_elastic

An Elasticsearch engine plugin for Moodle's Global Search
https://moodle.org/plugins/search_elastic
GNU General Public License v3.0
16 stars 13 forks source link

Tika file size limit #31

Closed mattporritt closed 7 years ago

mattporritt commented 7 years ago

It appears that Tika does not have a configuration based limit for the size of file that can be processed by the Tika service. Instead it seems to be limited by the Java memory for the Tika applicaiton. This is not ideal.

To give some control over the size of files submitted to Tika we need to add a user configuration option to this plugin. This configuration option will limit the size of the file sent to Tika. If a file is larger than this setting a file record in Elasticsearch will be created but the the file content will not be included in the index.

mattporritt commented 7 years ago

latest master has added this functionality for this