Norconex / importer

Norconex Importer is a Java library and command-line application meant to "parse" and "extract" content out of a file as plain text, whatever its format (HTML, PDF, Word, etc). In addition, it allows you to perform any manipulation on the extracted text before using it in your own service or application.
http://www.norconex.com/collectors/importer/
Apache License 2.0
33 stars 23 forks source link

ElasticCommitter - Restrict based on action verbs (add, update, delete) #126

Closed stejacob closed 3 weeks ago

stejacob commented 3 weeks ago

Hi,

We would like to restrict specific actions for Elastic committers based on the type of operation (add, update, or delete).

We have three committers, and we're aiming to implement the following restrictions:

  1. One committer should be restricted to only handle English documents based on language field containing value en.
  2. Another committer should be restricted to handle only French documents based on language field containing value fr.
  3. The third committer should be restricted to perform delete operations only.

However, we're unsure if this can be achieved with the current configuration. At the moment, using the restrictTo option based on a field causes delete commands to be rejected, as the field doesn't exist in the document being deleted.

If this is not yet supported, would it be possible to open an enhancement request?

Any guidance on how to achieve this setup would be greatly appreciated.

Thank you.

Stephen Jacob