intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
https://intel.github.io/neural-compressor/
Apache License 2.0
2.23k stars 257 forks source link

update v2.6 release readme #1871

Closed chensuyue closed 5 months ago

chensuyue commented 5 months ago

Type of Change

documentation

Description

  1. release version flag update.
  2. remove conda build.
  3. update installation readme

Expected Behavior & Potential Risk

NA

How has this PR been tested?

NA

Dependency Change?

NA

github-actions[bot] commented 5 months ago

⚡ Required checks status: All passing 🟢

No groups match the files changed in this PR.


Thank you for your contribution! 💜

Note This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.