Closed insanesac closed 3 months ago
Are you sure the .safetensors file isn't corrupt?
I thought that was the case too. So switched branches - 8.0bpw, 6.0bpw and 4.0bpw. The same error. Let me download the files individually and then see if that solved the issu
Looks like it was corrupted. Downloaded the files individually fixed it. Thanks
python3 test_inference.py -m /app/Llama2-7B-chat-exl2 -p "Once upon a time"
I get MemoryError as shown in the screenshot. The same happens when I run chat.py. I tried logging to know which file was causing the error. output.safetensors is the cause.
I have an Ubuntu 22.04 machine with nvidia T4 GPU. The machine also has 64GB RAM.