Non-square images ? - Githubissues

clovaai / donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

https://arxiv.org/abs/2111.15664

MIT License

5.71k stars 462 forks source link

Open MohamedAliRashad opened 1 year ago

MohamedAliRashad commented 1 year ago

How to train the model on a non-square images ? I tried an input of 2200x1700 and this is the error i got:

RuntimeError: shape '[1, 55, 10, 42, 10, 1]' is invalid for input of size 233750

Wyzix33 commented 1 year ago

It seems like input needs to be multiple of 320...