rikeilong / Bay-CAT

[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Apache License 2.0
41 stars 1 forks source link

Code Release #2

Open XuecWu opened 6 months ago

XuecWu commented 6 months ago

Great work! When will the full code be released?

Looking forward to hearing from you. Thanks a lot.