yiren-jian / BLIText-video

[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training: Video Captioning
BSD 3-Clause "New" or "Revised" License
1 stars 0 forks source link