Open sz2three opened 1 year ago
when use python quant_infer.py --wbits 2 --load pyllama-7B8b.pt --text "..." --max_length 24 , it raises error NVMLError_NoPermission: Insufficient Permissions
python quant_infer.py --wbits 2 --load pyllama-7B8b.pt --text "..." --max_length 24
NVMLError_NoPermission: Insufficient Permissions
when use
python quant_infer.py --wbits 2 --load pyllama-7B8b.pt --text "..." --max_length 24
, it raises errorNVMLError_NoPermission: Insufficient Permissions