DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
BSD 3-Clause "New" or "Revised" License
2.77k stars 255 forks source link

Finetune with LoRA and QLoRA #162

Open thisurawz1 opened 5 months ago

thisurawz1 commented 5 months ago

can you tell me how to use LoRA or QLoRA to finetune this model? moreover how to load the entire model from huggingface?