A working pull request aimed to support SFT, DPO, RM and PPO for text-video-image to text models (specifically, Qwen2-VL)
Motivation and Context
close #75
[x] I have raised an issue to propose this change (required for new features and bug fixes)
Types of changes
What types of changes does your code introduce? Put an x in all the boxes that apply:
[ ] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds core functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to change)
[x] Documentation (update in the documentation)
Checklist
Go over all the following points, and put an x in all the boxes that apply.
If you are unsure about any of these, don't hesitate to ask. We are here to help!
[x] I have read the CONTRIBUTION guide. (required)
[x] My change requires a change to the documentation.
[ ] I have updated the tests accordingly. (required for a bug fix or a new feature)
Description
A working pull request aimed to support SFT, DPO, RM and PPO for text-video-image to text models (specifically, Qwen2-VL)
Motivation and Context
close #75
Types of changes
What types of changes does your code introduce? Put an
x
in all the boxes that apply:Checklist
Go over all the following points, and put an
x
in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!