Purpose and description of q-extract-torch

Let me tell you the result of our meeting.

New quantize techniques are sometimes implemented using PyTorch. Therefore, q-implant aims to output quantized circle model by inputting circle model and json & npy files containing quantization parameters of the quantized PyTorch model.

However, converting quantized pytorch model to onnx model is not officially supported. Therefore, starting q-implant from onnx rather than pytorch is difficult to satisfy q-implant's design objectives.

q-extract is a module designed to obtain circle model and json & npy files corresponding to the input of q-implant. Therefore, we judged that implementing q-extract for quantized onnx model rather than quantized pytorch model was also not in line with the existing purpose.

The figure above is a diagram of q-extract-torch.

I will explain the inputs and outputs of the q-extract-torch module in text. Input 1: original PyTorch model (not quantized) Input 2: quantized PyTorch model (quantized) Output 1: Circle model (not quantized, to be used as input for q-implant) Output 2: json & npy files (to be used as input for q-implant)

In addition, I will explain the functionality of the q-extract-torch module. First, Convert original PyTorch model (Input 1) to create original Circle model (Output 1). Second, Two input PyTorch models (Input 1 & 2) store mapping information for models before and after quantization. Based on this mapping information, output json & npy files (Output 2) corresponding to the input of q-implant.

ONEforALL-S003 / ONE

Purpose and description of q-extract-torch #14