huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
https://huggingface.co/transformers
Apache License 2.0
132.15k stars 26.33k forks source link

Track progress for VLMs refactoring #33374

Open zucchini-nlp opened 1 week ago

zucchini-nlp commented 1 week ago

This issue tracks the progress on improving the handling and testing of Vision-Language Models. The main goals are to enhance/enable generation tests, handle other generation techniques like assisted decoding and ensure all models pass CI checks.

I already started working on it and merged/opened some PRs. This issue should help us track how much is left until VLMs are standardized from modeling code perspective.

Motivation

,

Your contribution

.

zucchini-nlp commented 1 week ago

cc @gante 😄