camenduru / text-generation-webui-colab

A colab gradio web UI for running Large Language Models
The Unlicense
2.08k stars 366 forks source link

new to *free* google colab #35

Open nbollman opened 1 year ago

nbollman commented 1 year ago

Being on an NVIDIA T4, Is it possible to utilize xformers, and use exllamav2 as the loader for (mistral flavor of your choice)GPTQ 4bit 32gs ... I have a feeling it would perform blazingly fast with minimal degradation and great context... But you've spent more time on this...