new results or previous results could not find all raise issues in CI model test

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

https://intel.github.io/neural-compressor/

Apache License 2.0

2.23k stars 257 forks source link

Closed chensuyue closed 3 months ago

chensuyue commented 3 months ago

validation update

New results or previous results could not find all raise issues in CI model test, so the test will raise accuracy regression on time.

the expected behavior that triggered by this PR

how to reproduce the test (including hardware information)

any library dependency introduced or removed