Closed vedantroy closed 1 year ago
This seems to be a bug in Triton (https://github.com/openai/triton/issues/1298). Do you have Triton installed at HEAD or 2.0.0?
This seems to be a bug in Triton (openai/triton#1298). Do you have Triton installed at HEAD or 2.0.0?
I can check. Which version should I have it installed as?
pip list | grep triton
gives
triton 2.0.0
Hmm, I guess try HEAD to see if they've fixed it. Otherwise, please let me know what GPU you're testing on so I can try to reproduce.
Works with triton 2.1.0
I'm trying to load the 13B quantized model (which I quantized using the script in this repository). But I get the following error: