HarderThenHarder / transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
https://www.zhihu.com/column/c_1451236880973426688
2.11k stars 376 forks source link

Hi,后面会考虑复现基于大模型如bloom、llama的rlhf流程代码? #57

Open nieallen opened 1 year ago

nieallen commented 1 year ago

如题。目前的rlhf是基于gpt2,且不是instructgpt的那套流程