Maybe don't need this rearrange

lucidrains / CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

MIT License

1.04k stars 88 forks source link

Closed CiaoHe closed 2 years ago

CiaoHe commented 2 years ago

lucidrains commented 2 years ago

@CiaoHe Hi again :wave: :smile: turns out I had that notated incorrectly https://github.com/lucidrains/CoCa-pytorch/blob/main/coca_pytorch/coca_pytorch.py#L461 sorry for the confusion!

CiaoHe commented 2 years ago

@lucidrains Oh, I misunderstand the usage of CE. Ha, yes, the input logits should be 'b, vocab_size, length'