[Evalated] Support of QuantizedXPU // Issues of test__view_ops_xpu.py

intel / torch-xpu-ops

Apache License 2.0

29 stars 21 forks source link

@EikanWang should be evaluating the right way to enable quantization for XPU backend. Even CUDA has the dispatch key, and our philosophy is to align with CUDA, the usage of QuantizedXPU is for PyTorch legacy quantization solution, which we would not follow up. The Tensor with QuantizedXPU dispatch key implies quantization information. But in other quantization solution, we don't need such a Tensor representation, and we put scale and shift in a separate Tensor. Operator API or graph will introduce the scale and shift Tensors. So let me lower the priority of the issue.

intel / torch-xpu-ops

[Evalated] Support of QuantizedXPU // Issues of test__view_ops_xpu.py #772

🚀 The feature, motivation and pitch

Alternatives

Additional context