alibaba / AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab
Apache License 2.0
1.98k stars 291 forks source link

how to get the pre-trained model "ViT-L-14.tar" #79

Open shuxjweb opened 1 year ago

shuxjweb commented 1 year ago

I download the pre-trained model "ViT-L-14.pt"x and its feature is 768. However, the vision_width in yaml file is set 1024. This is different.