options used for the command train of calamari OCR

Calamari-OCR / calamari

Line based ATR Engine based on OCRopy

Apache License 2.0

1.04k stars 209 forks source link

The parameters are already optimized for best results. You might be able to achieve a little lower CER by using larger networks, but only at the cost of longer training. For some insights in successful training procedures, have a look at this or that paper.

Besides from the parameters mentioned here, most parameters work for both training from scratch and warm starting. Setting parameters for network architecture does not work when starting with a pretrained model, obviously.

In everyday use I tend to set --n_augmentations=5 and train a set of 5 models using calamari-cross-fold-train.

Calamari-OCR / calamari

options used for the command train of calamari OCR #286