[Feature request] Support quantization

MNeMoNiCuZ / joy-caption-batch

A batch captioning tool for joy_caption

MIT License

66 stars 6 forks source link

Closed MNeMoNiCuZ closed 1 week ago

MNeMoNiCuZ commented 2 weeks ago

Suppor model quantization with different precision.

MNeMoNiCuZ commented 1 week ago

Now supports "unsloth/llama-3-8b-bnb-4bit" Change the setting: LOW_VRAM_MODE = True # Option to switch to a model that uses less VRAM