zhaoyl18 SEIKO issues - Githubissues

zhaoyl18 / SEIKO

SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.

https://arxiv.org/abs/2402.16359

MIT License

14 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Question about the implementation of bootstrap reward

#3 Guo-Stone closed 3 weeks ago
1
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

#2 nmsl121381 closed 1 month ago
1
Experiment in Molecules

#1 nmsl121381 closed 3 weeks ago
1

zhaoyl18 / SEIKO

issues

Question about the implementation of bootstrap reward

Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

Experiment in Molecules