Cool job.
The quantification method mentioned in the article has good generality. Is there any possibility to directly support any torch model, without such a complex usage method.
The ideal method is to take a torch model, be able to use Python code to quantize the model, save it, and then use it directly.
Cool job. The quantification method mentioned in the article has good generality. Is there any possibility to directly support any torch model, without such a complex usage method. The ideal method is to take a torch model, be able to use Python code to quantize the model, save it, and then use it directly.