Closed xjdeng closed 1 year ago
Apparently multimodal didn't like AutoGPTQ
I also experience this same exact error on a home based setup with a 3090 24GB card. Worked great before i did a update_linux two days ago. Now multimodal is broken. Worked really well before with almost all Llama based models.
Does anyone have a temporary fix to this error (maybe install an older branch)?
ATM, i wiped my one-click folder, started fresh. I am on Ubuntu. I then cloned the one-click git repo again, and then edited the webui.py command args to include "--gptq-for-llama" before running start_linux.sh. Then run yer start script and see how that goes. It finally all went back together right last night for me, now streaming speed is good again and multimodal works again as well. There is some issue with AutoGPTQ, switch back to GPTQ-for-llama for now.
The issue seems to be fixed after this commit: https://github.com/oobabooga/text-generation-webui/commit/e471919e6d504e85ac1aa58ad6bf0d46d0d9323d
Appreciate the effort but after checking out the latest commit, it seems to be working but the text generation is REALLY slow. Thought it was an issue with my colab but I tried to revert to 19f7868 and it was back to its usual speed again.
I get the same speed for minigpt4-13b with AutoGPTQ or GPTQ-for-LLaMa. It is possible that the AutoGPTQ wheel is not optimized for the Colab GPU or environment. A workaround for now is to use --gptq-for-llama
(or check gptq-for-llama
in the UI before loading the model).
That fixed the speed issue!
Describe the bug
I just started having this issue yesterday when I was running minigpt4-13b using TheBloke_Wizard-Vicuna-13B-Uncensored-GPTQ, specifically when I ask it to describe an image
Is there an existing issue for this?
Reproduction
Fire up a free Colab instance and run the following:
Then load the Gradio link, drag an image into the multimodal section, and put a prompt like "Describe the image" and hit Generate. It'll be stuck and if you go back to your Colab notebook, you'll get an error.
Screenshot
No response
Logs
System Info