⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
2.11k
stars
376
forks
source link
Hi,后面会考虑复现基于大模型如bloom、llama的rlhf流程代码? #57
Open
nieallen opened 1 year ago
如题。目前的rlhf是基于gpt2,且不是instructgpt的那套流程