Extractor in vit_pytorch will detach the tensor.

lucidrains / CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

MIT License

1.04k stars 88 forks source link

Closed techkang closed 2 years ago

techkang commented 2 years ago

Thanks for your code! I think I may find a little bug. The cloned tensor in Extractor will be detached (https://github.com/lucidrains/vit-pytorch/blob/main/vit_pytorch/extractor.py#L39). So gradient may not propagate back to image encoder.

lucidrains commented 2 years ago

techkang commented 2 years ago

The bug is fixed. Thank you.