SmilingWolf / JAX-CV

Repo for my JAX CV experiments. Mostly revolving around the Danbooru20xx dataset
24 stars 4 forks source link

Question about the image size #13

Closed pink-red closed 3 months ago

pink-red commented 3 months ago

Hello! Following #12, I've noticed that you've corrected the image size from 448 to 512. That got me curious: why is the size different the one in configs of the models on Hugging Face?

By the way, thanks for creating this amazing project!

SmilingWolf commented 3 months ago

The images are randomly cropped (only a little bit, don't want to risk cutting away too much) and resized using random algorithms during training as part of the augmentation pipeline, to make the models more resilient to shifting, zooming and the particular choice of resizing software during inference.

Saving the images slightly bigger than the target size makes it so that the cropping and resizing augmentations don't end up being no-ops.

pink-red commented 3 months ago

Ah, I see, thanks!