microsoft / CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
MIT License
533 stars 120 forks source link

what should I change if I want to use a data set with images of 750* 184 #11

Open Oussamab21 opened 2 years ago

Oussamab21 commented 2 years ago

Hi, thanks for sharing the code, I am using a dataset that can be converted to images of size 750* 184, I was wondering what should I change in this code ? thanks in advance

sauravgarg540 commented 2 years ago

Hi @Oussamab21, The first layer of the model is a convolution layer hence you don't need to change the model for processing different image dimensions. You need to create a YAML file with your desired image dimension.