Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

zhaoyl18 / SEIKO

SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.

https://arxiv.org/abs/2402.16359

MIT License

14 stars 0 forks source link

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models #2

Closed nmsl121381 closed 1 month ago

nmsl121381 commented 2 months ago

Hi, Zhao Yulai, I noticed that in another article Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models, there seems to be no mention of the relevant code base. Can you also share the code of this article? Thank you very much.

zhaoyl18 commented 1 month ago

Thanks for being attentive to our work. As that paper is under review, we will release the code later.