add online doc for 2.4, 2.5, 2.6, 3.0

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

https://intel.github.io/neural-compressor/

Apache License 2.0

2.23k stars 257 forks source link

Closed NeoZhangJianyu closed 3 months ago

NeoZhangJianyu commented 3 months ago

add online doc for 2.4, 2.5, 2.6, 3.0

detail description

the expected behavior that triggered by this PR

how to reproduce the test (including hardware information)

any library dependency introduced or removed