New text-to-image modality - Githubissues

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Apache License 2.0

101 stars 27 forks source link

New text-to-image modality #64

Closed Reindulger closed 1 week ago

Reindulger commented 1 week ago

Description

Added a new text-to-image modality, supporting models such as stable-diffusion and Chameleon. Benchmarks now include support for ImageRewardDB and HPSv2.
Optimize benchmarks to meet the criteria of lm-eval and lmms-eval metrics, including datasets such as MMLU, CMMLU, Belebele, GSM8K, MME, MMBench, and MM-Vet.
Fix some bugs for evaluation.

Types of changes

What types of changes does your code introduce? Put an x in all the boxes that apply:

[x] Bug fix (non-breaking change which fixes an issue)
[x] New feature (non-breaking change which adds core functionality)
[ ] Breaking change (fix or feature that would cause existing functionality to change)
[ ] Documentation (update in the documentation)

Checklist

Go over all the following points, and put an x in all the boxes that apply. If you are unsure about any of these, don't hesitate to ask. We are here to help!

[ ] I have read the [CONTRIBUTION](https://github.com/PKU-Alignment/align-anything/blob/HEAD/.github/CONTRIBUTING.md) guide. (required)
[ ] My change requires a change to the documentation.
[ ] I have updated the tests accordingly. (required for a bug fix or a new feature)
[ ] I have updated the documentation accordingly.