issues
search
HumanSignal
/
RLHF
Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI models
154
stars
30
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
REWARD_CHECKPOINT_PATH ( How I solve following issue?)
#7
nadimkaysar
opened
1 month ago
0
How to fix the following errors?
#6
missflash
opened
9 months ago
1
Create LICENSE
#5
seele1917
opened
11 months ago
0
Update README.md
#4
erinmikailstaples
closed
1 year ago
0
Added images and details for PPO training
#3
JimmyWhitaker
closed
1 year ago
2
Update Readme structure
#2
niklub
closed
1 year ago
0
Example/rlhf nb
#1
JimmyWhitaker
closed
1 year ago
0