Open given131 opened 5 months ago
Hello,
I have a question regarding the embedding modification inside MultiModal2 module, right after getting the output of the CLIPModel.
MultiModal2
CLIPModel
It seems when image evidences exist, the cls token of the image embedding is removed (https://github.com/VT-NLP/Mocheg/blob/main/verification/model.py#L160)
.whereas when no text evidences given that of the text embedding is removed. (https://github.com/VT-NLP/Mocheg/blob/main/verification/model.py#L167)
My questions are,
Hello,
I have a question regarding the embedding modification inside
MultiModal2
module, right after getting the output of theCLIPModel
.It seems when image evidences exist, the cls token of the image embedding is removed (https://github.com/VT-NLP/Mocheg/blob/main/verification/model.py#L160)
.whereas when no text evidences given that of the text embedding is removed. (https://github.com/VT-NLP/Mocheg/blob/main/verification/model.py#L167)
My questions are,