OCR-D / ocrd_anybaseocr

DFKI Layout Detection for OCR-D
Apache License 2.0
48 stars 12 forks source link

block segmentation: non-text classes and prebuilt models #81

Open bertsky opened 3 years ago

bertsky commented 3 years ago

In e941321a507ce9f4f6d6416117e441124605748a it seems 3 non-text classes arrived: ImageRegion, TableRegion and GraphicsRegion. However, the Config.NUM_CLASSES remained the same, and equally the provided block_segmentation_weights.h5 still have only 1+14 classes:

>>> import h5py
>>> f = h5py.File('block_segmentation_weights.h5', 'a')
>>> f['mrcnn_class_logits/mrcnn_class_logits/kernel:0']
<HDF5 dataset "kernel:0": shape (1024, 15), type "<f4">

So @khurramHashmi what am I to make of this? Did you forget to publish/upload your new model file?

I would really like to see images and tables detected here.