How to determine the right values for input_size?

@htcml

The smaller input_size you provide, the larger compression of your original images will be (thumbnail is created out of the original image so some information is lost during resizing).

Worth noticing is that in the config file, the input_size is [height, width], not the opposite. So it's natural that if you images are 5000 in width and 6000 in height, then [1920, 1280] outperforms [1280, 1920].

In the latter scenario, your images (5000x6000) (w x h) are firstly compressed to size of (1066x1280) (w x h), and then padded randomly to size of (1920x1280) (w x h). As you can imagine, there is lots of padded area (1M pixels) which is not used optimally.

In the former scenario, your images (5000x6000) (w x h) are firstly compressed to size of (1280x1536) (w x h), and then padded randomly to size of (1280x1920) (w x h). Less area is padded (0.5M), so the original image takes larger area in the final image provided to the model vs. the previous scenario.

clovaai / donut

How to determine the right values for input_size? #119