Open AdharshC7777 opened 4 months ago
I would say you need interpolation since your image has not the same resolution as the trained model (higher or lower).
Some details here: https://discuss.huggingface.co/t/fine-tuning-image-transformer-on-higher-resolution/22623/6
just as the title says, do we need interpolation of positional embeddings in the scenerio. If yes, why?