rikeilong / Bay-CAT

[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Apache License 2.0
41 stars 1 forks source link

About code and pre-trained checkpoint #5

Open xqfJohn opened 4 months ago

xqfJohn commented 4 months ago

Hi, it's really nice work! Could you make the code and pre-trained checkpoint available for public? Thanks and best wishes.