Open fushh opened 2 months ago
Thanks for the great work! In stage 3, the video encoder is updated to improve its support for video-centric dialogue. Will stage 3 training affect the performance on basic video tasks? Any comparisons like Table 4 is expected.
Thanks for the great work! In stage 3, the video encoder is updated to improve its support for video-centric dialogue. Will stage 3 training affect the performance on basic video tasks? Any comparisons like Table 4 is expected.