Open Dahoas opened 1 year ago
Implementation of multi-generation RL in trlX
Suggested (but optional) external inference pipeline wrapper can be found here
Depends on #529
Implementation of multi-generation RL in trlX
Suggested (but optional) external inference pipeline wrapper can be found here