issues
search
bzluan
/
TextCoT
The official repo for “TextCoT: Zoom In for Enhanced Multimodal Text-Rich Image Understanding”.
32
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
prepare_stage2_question.py cannot be found.
#5
hanzefang
opened
1 month ago
1
Results on the TextVQA benchmark
#4
Gavin001201
opened
2 months ago
1
No answer_format.json file!!!
#3
orormaybe
opened
2 months ago
3
installation and demo
#2
GallonDeng
opened
6 months ago
0
llava 1.5 textvqa baseline 为何如此低?
#1
hhaAndroid
closed
6 months ago
2