PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback
https://align-anything.readthedocs.io
Apache License 2.0
260 stars 47 forks source link

feat: Support SFT, DPO, RM and PPO for text-video to text models #76

Closed htlou closed 2 months ago

htlou commented 2 months ago

Description

A working pull request aimed to support SFT, DPO, RM and PPO for text-video-image to text models (specifically, Qwen2-VL)

Motivation and Context

close #75

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

Checklist

Go over all the following points, and put an x in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!

htlou commented 2 months ago

Fixed all the comments and all the changes requested. Thus request another round of review.