slub / ocrd_kitodo

Docker integration of Kitodo.Production and OCR-D
MIT License
9 stars 6 forks source link

allow re-runs #17

Closed bertsky closed 2 years ago

bertsky commented 2 years ago

Sometimes, processing might fail due to temporary downtimes. Or bugs in tools which get fixed subsequently.

Regardless, it should be easy to re-run the same workflow on a workspace again. To that end, ocrd process offers --overwrite (as does ocrd workflow client process, and ocrd-make always uses it).

But what about badly written workflows or data imported from presentation – is overwrite always the right thing to do?

bertsky commented 2 years ago

Besides incremental processing (for which ocrd process --override does help), we also have the initial, unconditional step of ocrd-import of workspace (which is not possible if the METS already exists).

And we must now consider that residual data lives under either: