Closed KaifAhmad1 closed 5 months ago
Hi @KaifAhmad1, idefics2 can be loaded using AutoModelForVision2Seq
Hi @amyeroberts, any tips for optimizing IDEFICS-2 on Tesla T4? Tried unsloth library but it doesn't support Multimodal LLMs. Any other ideas?
I've only used the model on A10G, so don't know about Tesla T4. This is a question best placed in our forums. We try to reserve the github issues for feature requests and bug reports.
I'd suggest opening an feature request to support more modalities in unsloth - they do great work and it would surely be very impactful for all users!
I don't know too much about the ins-and-outs of what unsloth does. Some of the techniques for finetuning models e.g. LoRA are also available through Hugging Face's peft library.
If just for inference, there's an accelerate guide on using large models here and quantization in transformers here
Thanks! @amyeroberts
System Info
Cuda : 12.1 OS : Windows x64 pip : 24.0 python : 3.10.10 transformers : 4.40.0 bitsandbytes: 0.43.1
Who can help?
Hey, there @younesbelkada , @amyeroberts I am getting this exception when Quantizing IDEFICS-2 for custom fine tuning!
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Expected behavior
It will run without raising any error!