issues
search
CarperAI
/
DRLX
Diffusion Reinforcement Learning Library
MIT License
171
stars
7
forks
source link
DPO
#30
Open
shahbuland
opened
8 months ago
shahbuland
commented
8 months ago
Add DPO LoRA support
Should add non-lora support as well but this is WIP for now