Closed dnouri closed 1 year ago
Previously generation would fail with:
File "/alpaca_lora_4bit/text-generation-webui/matmul_utils_4bit.py", line 79, in _matmul4bit_v1_recons quant_cuda.vecquant4recons_v1(qweight, buffer, scales, zeros) RuntimeError: expected scalar type Half but found Float
See #71
Previously generation would fail with:
See #71