Closed weiji14 closed 9 months ago
Gonna merge this PR directly, as it consists of mostly minor tweaks (which I've accumulated over the past few weeks). Some of the options (e.g. patch_size) can be changed later, but thought it'd be good to have a datapipe/model that works with the 512x512 images soon-ish.
Modify the ViT MAE model to accept input images of size 512x512 pixel after #78. Also making a few small enhancements to the datapipe.
TODO: