issues
search
rikeilong
/
Bay-CAT
[ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios
Apache License 2.0
41
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Release model/dataset on HF
#7
NielsRogge
opened
2 months ago
0
About code and model checkpoint
#6
kaiw7
opened
4 months ago
0
About code and pre-trained checkpoint
#5
xqfJohn
opened
4 months ago
0
About table2 comparison on Music-AVQA dataset
#4
Cece1031
opened
4 months ago
2
About the dataset processing
#3
Yuzuriha-Inori-x
opened
5 months ago
2
Code Release
#2
XuecWu
opened
6 months ago
0
About cue aggregator
#1
yeppp27
opened
7 months ago
4