qqingzheng / AI-Self-Training-DPO-SDXL

Unofficial implementation. Stable diffusion model trained by AI Feedback-Based Self-Training Direct Preference Optimization.
59 stars 6 forks source link