kvablack / ddpo-pytorch

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
MIT License
424 stars 42 forks source link

Update README.md to include a note about the `trl` integration #16

Closed sayakpaul closed 1 year ago

sayakpaul commented 1 year ago

@kvablack FYI.