Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
7.71k
stars
669
forks
source link
Is there any documentation to train this on my own data ? #59
Open
gauravgandhi1315 opened 9 months ago