muzairkhattak / multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
https://muzairkhattak.github.io/multimodal-prompt-learning/
MIT License
619 stars 43 forks source link

the dimension of image #25

Closed SHIBOYA closed 1 year ago

SHIBOYA commented 1 year ago

The dimension of the input picture in maple is (3,224,224), where 3 is the RGB three-channel? Or GBR?

muzairkhattak commented 1 year ago

Hi @SHIBOYA,

The input image to MaPLe is (3, 224, 224) with RGB channel sequence.

Thank you.