unable to get results when evaluating on msvd-qa benchmark

dvlab-research / LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Apache License 2.0

622 stars 39 forks source link

Closed irisgong1020 closed 23 hours ago