Dantong88 / LLARVA

Apache License 2.0
29 stars 0 forks source link

poor inference performance with the provided finetuned checkpoints #6

Closed hengyuan-zhang-0 closed 1 hour ago

hengyuan-zhang-0 commented 2 hours ago

image I merged the Lora checkpoints provided here and followed the inference guide, but the results I obtained are not ideal, as shown in the image below. @Dantong88 Can you provide some help? image

hengyuan-zhang-0 commented 2 hours ago

question_id: 2,3,4 corresponding images: image image image

Dantong88 commented 1 hour ago

the fine-tuned checkpoint might suck in some specific subsets because it is co-trained in a very diverse action-vision vqa set, you can further tune it to adapt it to your target dataset.