changes and setup - Githubissues

jackaduma / Vicuna-LoRA-RLHF-PyTorch

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna

MIT License

208 stars 18 forks source link

changes and setup #17

Open Ekanshjain55 opened 6 months ago