muzairkhattak / multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
https://muzairkhattak.github.io/multimodal-prompt-learning/
MIT License
578 stars 43 forks source link

A feasible solution to use Maple with CNN-based VLMs #57

Closed AnonymousUserxx closed 3 months ago

AnonymousUserxx commented 3 months ago

Hi~

Thanks for sharing your codes! We found the Maple proposed in your paper may not be directly applicable to some VLMs using CNN-based encoders, such as ResNets. I just come up with a feasible solution to use Maple with CNN-based VLMs. The diagram is shown as below.

MaPlE_CNN_Encoder