Closed UdonDa closed 1 year ago
The released model has been finetuned on 512x512 images.
The input for MAE is 256x256, but the rest network is 512x512, right? Please excuse my lack of understanding.
Yes, the feature will be resized as mentioned in the paper.
I understand your method perfectly. Thank you for explaining to me!! I can run your code.
Thank you.
Hi, @ewrfcas
Thank you for providing your nice implementation! I have a question.
You evaluated your model using a 512x512 resolution. However, you provide only 256x256 weight using a Places dataset. Why don't you provide 512x512 weights?