Closed PonteIneptique closed 3 years ago
BTW, if you want to test anything regarding this, we got a dataset there: https://github.com/HTR-United/cremma-medieval
- What's the best way to deal with
No boundary
, from a user end perspective ?
Fix the bug in escriptorium. ;)
- Could repolygonize profit from multithreading (right now it's not parallelized, the easiest way would probably be there: https://github.com/mittagessen/kraken/blob/0f6bfd21f60c6dbb39e86c56474b052d029cf332/kraken/lib/dataset.py#L299 ? )
Hmm, I could move it from the XML parser to the dataset.
- If we have repolygonization, could we save it somehow (connected to 1) just to avoid redoing it for the next training ?
There is (was actually) already a script in contrib/ called
repolygonize.py
that did exactly that. I've committed a broken version
while rewriting it but will fix it before the end of the week.
Hello, if you want polygons be sure to click on the green button in the segmentation panel ("Segmentation is ready for mask calculation!"), the quality of polygons is a lot better once all the lines are drawn and we can't really guess when it is the case which is why this not automatic. You only need to do it once on a page, then they are recalculated automatically if need be. If you import data without polygons you can batch it by selecting your images and choosing 'Only line masks' in the segment form. Hope it helps.
if you want polygons be sure to click on the green button in the segmentation panel ("Segmentation is ready for mask calculation!")
I might be stupid, but can you screenshot me where this is ? :)
There is (was actually) already a script in contrib/ called
repolygonize.py
that did exactly that. I've committed a broken version while rewriting it but will fix it before the end of the week.
I'll close the issue when this is out :D
the green thumbs up button above the segmentation panel (2)
Well, we're gonna continue this conversation on segmentation on Gitlab, because we got no such things ?
it is indeed visible only if no line has a polygon. what were your previous actions on this page?
Hey @mittagessen, Currently, it seems that some export from eScriptorium are missing masks or at least something that is preventing the use of all data directly by Kraken. From what I understand, in this context,
--repolygonize
is the way to go.My question is in two/three parts:
No boundary
, from a user end perspective ?