jonaswinkler / paperless-ng

A supercharged version of paperless: scan, index and archive all your physical documents
https://paperless-ng.readthedocs.io/en/latest/
GNU General Public License v3.0
5.37k stars 358 forks source link

Trigger new rules for metadata via UI #429

Open Matthias84 opened 3 years ago

Matthias84 commented 3 years ago

If we create new rules, they currently get only applied to new documents, or if we trigger a full scan via the CLI. It would be nice, if we could apply a new rule to all documents. Even better, if we could delay the rescan, if we want to add more than just one new rule.

jonaswinkler commented 3 years ago

See https://paperless-ng.readthedocs.io/en/latest/administration.html#document-retagger,

also see https://github.com/jonaswinkler/paperless-ng/issues/264

MPatzek commented 3 years ago

I think having a button to run document_retagger or some other managment commands would be really nice. Logging into the console to run this is not really user-friendly.

igno2k commented 3 years ago

Does this also apply to the automatic tagging feature? It would be nice if documents in the inbox would be retagged with the automatic tagging feature as it gets trained by feeding more documents to it.

jonaswinkler commented 3 years ago

Yes, the retagger takes auto matching into account.

igno2k commented 3 years ago

Yes, the retagger takes auto matching into account.

Cool - thanks! It would be nice to have a button to start it in the GUI or maybe a task planned in the scheduler for it.

jonaswinkler commented 3 years ago

It would be nice to have a button to start it in the GUI

I know.

maybe a task planned in the scheduler for it.

That's actually a pretty neat idea. Have the re-tagger run on the inbox every day or so. That way, the inbox will dynamically update as you remove documents from the inbox and add correct metadata to them. This is actually possible with the current version, the configuration for this is just a little awkward.

rizz360 commented 2 years ago

This is actually possible with the current version, the configuration for this is just a little awkward.

How would I go about doing this?

document_retagger with the arguments alone in the scheduled tasks does not seem to work. What module to I have to specify?