pytorch / tvm

TVM integration into PyTorch
449 stars 64 forks source link

Quantized linear opt #120

Closed kimishpatel closed 4 years ago

kimishpatel commented 4 years ago

Tensorize quantized linear. Fixed a bug in the compute. Depends on this PR. https://github.com/facebookexperimental/tvm/pull/7

Specifically this PR does:

Benchmark comparision is against fbgemm implementation. Without the rounding we are 10% faster with rounding we are on par. Need to figure out why are we losing perf due to rounding.

kimishpatel commented 4 years ago

@yinghai, updated.