Closed willsamu closed 10 months ago
Adds support for AWQ quantisation by adding an additional build-arg QUANTIZATION.
QUANTIZATION
Updated vLLM core dependency in order to support new feature.
Tested to work with TheBloke/airoboros-l2-7B-3.0-AWQ and TheBloke/Airoboros-L2-70B-3.1.2-AWQ.
TheBloke/airoboros-l2-7B-3.0-AWQ
TheBloke/Airoboros-L2-70B-3.1.2-AWQ
Great work @willsamu, thank you for your contribution!
Commit: 4f792062aaea02c526ee906979925b447811ef48
Adds support for AWQ quantisation by adding an additional build-arg
QUANTIZATION
.Updated vLLM core dependency in order to support new feature.
Tested to work with
TheBloke/airoboros-l2-7B-3.0-AWQ
andTheBloke/Airoboros-L2-70B-3.1.2-AWQ
.