Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调
Apache License 2.0
4.31k
stars
422
forks
source link
请问推理需要占用多少显存,我40GB运行infer.sh提示:CUDA error: out of memory #69
python3.8 infer.py \ --base_model './llama-7b-hf' \ --lora_weights './lora-llama-med' \ --use_lora True \ --instruct_dir './data/infer.json' \ --prompt_template 'med_template'
在using lora的时候报错