PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback
https://align-anything.readthedocs.io
Apache License 2.0
260 stars 47 forks source link

feat: support diffusion dpo and sft #12

Closed Gaiejj closed 4 months ago

Gaiejj commented 4 months ago

Description

We support SFT and DPO for diffusion models.

An example of diffusion model trained with DPO:

A photo of beautiful mountain with realistic sunset and blue lake, highly detailed, masterpiece

image_4

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Checklist

Go over all the following points, and put an x in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!

XuyaoWang commented 4 months ago

LGTM.