allenai / RL4LMs

A modular RL library to fine-tune language models to human preferences
https://rl4lms.apps.allenai.org/
Apache License 2.0
2.21k stars 191 forks source link

Upgrade to torch 2.0 #69

Open agastyaseth opened 9 months ago

agastyaseth commented 9 months ago

are there any plans to port the library to torch 2? Since the parallelize() library is deprecated in torch 2, it becomes impossible to train larger models like llama 7b and mistral 7b even with A100 80GBs

zhangguoxin1 commented 7 months ago

i have the same problem.