X-PLUG / mPLUG-2

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
Apache License 2.0
220 stars 18 forks source link

Where can I find the universal layer module? #14

Closed Ta-Gu closed 11 months ago

Ta-Gu commented 11 months ago

I checked the file model_video_caption_mplug.py but I find there is no universal layer module in the code, but I see that images are first fed into visual encoders and then fed into text encoders. Does it mean universal layer is actually text encoder?

Ta-Gu commented 11 months ago

I am glad that I found it in the text_encoder and there is a variable named uni_layer.