intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
https://intel.github.io/neural-compressor/
Apache License 2.0
2.23k stars 257 forks source link

Skip some tests for torch 2.4 #1981

Closed yiliu30 closed 3 months ago

yiliu30 commented 3 months ago

Type of Change

bug fix API changed or not: None

Description

Some pt2e-related tests requires torch 2.5