AILab-CVC / SEED

Official implementation of SEED-LLaMA (ICLR 2024).
https://ailab-cvc.github.io/seed
Other
576 stars 31 forks source link

Stage I Contrastive Learning : What is 'final' causal embedding? #23

Closed zheedong closed 8 months ago

zheedong commented 9 months ago

Hi, I have question about how stage I training is conducted.

In paper, you say 'We use contrastive loss to maximize the similarity between the final causal embeddings and text features of the corresponding caption'. And what do you mean about 'final' causal embedding? It means last token of causal embedding? (dimension [batch, 1, 768]) or last layer of causal embedding? [batch, 32, 768])?

geyuying commented 8 months ago

It means last token of causal embedding (dimension [batch, 1, 768])