issues
search
zhaoyl18
/
SEIKO
SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all baselines (PPO, classifier-based guidance, direct reward backpropagation) for fine-tuning Stable Diffusion.
https://arxiv.org/abs/2402.16359
MIT License
14
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Question about the implementation of bootstrap reward
#3
Guo-Stone
closed
3 weeks ago
1
Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models
#2
nmsl121381
closed
1 month ago
1
Experiment in Molecules
#1
nmsl121381
closed
3 weeks ago
1