RLHF training code for StableVicuna open sourced?

Stability-AI / StableLM

StableLM: Stability AI Language Models

Apache License 2.0

15.84k stars 1.04k forks source link

RLHF training code for StableVicuna open sourced? #69

Open REIGN12 opened 1 year ago

REIGN12 commented 1 year ago

Very exciting to see you guys' remarkable work on stablevicuna!! And I read through your blog and notice that all the dataset is open sourced and available; however, considering the training code part, the only mentioned details are that you are using trlx for training. So will there be any more detailed recipe or code for the RL tuning phase? Many thanks in advance and really appreciate your effort!!

LouisCastricato commented 1 year ago

WIP.