lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
MIT License
7.67k stars 668 forks source link

How to fine-tune and train on my own data? #20

Open rbhatia46 opened 1 year ago

rbhatia46 commented 1 year ago

Hi, Any references to train this on my own data ?