Open mxjmtxrm opened 1 month ago
Hi, great work! I met some problems during 4bit weight-only quantization(--lwc).
I quantized a llama model using different lwc hyper-parameters and received different results.
Hi, great work! I met some problems during 4bit weight-only quantization(--lwc).
I quantized a llama model using different lwc hyper-parameters and received different results.