intel / torch-xpu-ops

Apache License 2.0
30 stars 21 forks source link

[PT2.6] INT8 quantization (PT2E) feature on Linux #1003

Open riverliuintel opened 1 month ago

riverliuintel commented 1 month ago

🚀 The feature, motivation and pitch

Request INT8 quantization (PT2E) feature on Linux. It requires, implement PT2E infrastructure for Intel GPU path, complete essential oneDNN, Triton quantized INT8 ops, pass benchmark models quantization testing.

And complete essential docs changes

Alternatives

No response

Additional context

No response