model behaviour on text vs. non-text

@bertsky First, let me clarify that our binarization models are not exclusively trained with the DIBCO dataset. In the early stages, the DIBCO dataset was the only ground truth (GT) available to us, so we initially trained some models using it. We then used these trained models to generate pseudo-labeled GT from the SBB datasets. To achieve this, I applied thresholding to binarize almost everything (every element in document images), and then employed scaling and cropping to improve binarization and extract only the desired results from each document image. Consequently, we ended up with a mix of the DIBCO dataset, containing mostly text content, and pseudo-labeled datasets from SBB, which included non-text content as well.

qurator-spk / sbb_binarization

model behaviour on text vs. non-text #69