THUDM / ImageReward

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Apache License 2.0
1.12k stars 62 forks source link

Implementation of ReFL StableDiffusionXL with LoRA #94

Closed alirezadizaji closed 1 month ago

alirezadizaji commented 1 month ago

Hi, This PR implements training ReFL with LoRA using SDXL, which enables faster fine-tuning with higher batch size.

xujz18 commented 1 month ago

Cool! Thanks a lot!