jackaduma / Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
MIT License
208 stars 18 forks source link

Does it really work ob RTX2080Ti ? #1

Open GuofaHuang opened 1 year ago

GuofaHuang commented 1 year ago

What optimizations have been made to make it possible to work on 12G-RTX2080Ti ? Reduce accuracy to 2bit ?

jackaduma commented 1 year ago

What optimizations have been made to make it possible to work on 12G-RTX2080Ti ? Reduce accuracy to 2bit ?

Please read code or README。 the LoRA config and train parameters is all opened.