DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
BSD 3-Clause "New" or "Revised" License
2.83k stars 263 forks source link

模型错误输出结果 #169

Closed shiyeeee closed 4 months ago

shiyeeee commented 4 months ago

from modelscope import snapshot_download, AutoModelForCausalLM, AutoTokenizer,GenerationConfig model_dir = snapshot_download("damo/videollama_7b_llama2_finetuned", revision='v0.1.1',cache_dir='./model')模型是用的这个方式下载,但是上传视频或图片之后只会回答类似如下图片的内容,求解。

微信图片_20240719153955