OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Apache License 2.0
1.4k stars 85 forks source link

Asking for the VLN training code #50

Closed zhangyuejoslin closed 11 months ago

zhangyuejoslin commented 1 year ago

Hi Authors, thank you for sharing your code. Could you release the training code of vision and language navigation task (HAMT-based mode)? It seems only evaluation code is released now. Thanks!

wz0919 commented 11 months ago

Hi,

Sorry for the late reply! We've updated the training code to the repo.

Best