Closed ThomasBaruzier closed 8 months ago
It looks like the variants
list is empty? The script is written to test multiple versions of the same model. You'd point it to /mnt/models/my_model
and it would scan for /mnt/models/my_model/3.0bpw
, /mnt/models/my_model/4.0bpw
etc. For a quick fix you should be able to just do:
model_base = "/mnt/models"
variants = ["my_model"]
Or something like that.
Yes, that was the issue. Thank you very much!
Example:
I haven't tested with other benchmarks. Perplexity evaluation is working with test_inference.py, though.
Environment: Arch Linux RTX 3090 CUDA 12.1.105 Exllama V2 commit 7d37b50d90908f899a45ef8ee901a4105c19fbe3
Is there anything I can do get this script to work? Thanks.