quantization parameters and convert model step by step (AIV-393)

espressif / esp-dl

Espressif deep-learning library for AIoT applications

MIT License

519 stars 116 forks source link

quantization parameters and convert model step by step (AIV-393) #53

Closed PureHing closed 2 years ago

PureHing commented 2 years ago

@yehangyang Hi, Get the onnx model according to the code,which named mnist_model_pytorch1.onnx(removed softmax ): Screenshot_select-area_20210831134128

Then,get the mnist_calib.pickle by executing quantization_tool/examples/example.py.

mnist_calib.pickle

``` >>> f=open("mnist_calib.pickle",'rb') >>> a=pickle.load(f) >>> a {'9': 16.0, '11': 8.0, 'output': 4.0, 'input': 64.0, '8': 16.0, '10': 8.0, '7': 64.0, 'fc1.weight': array([256.]), 'fc1.bias': 16.0, 'fc2.weight': array([256.]), 'fc2.bias': 8.0, 'fc3.weight': array([128.]), 'fc3.bias': 4.0} >>> ```

Does the value(16.0,8.0,...) in the dictionary represent the output_exponent value in the config json file?

BTW,For my own model, are the following steps correct? 1.prepare a float32 model,and convet to onnx model 2.executing quantization_tool/examples/example.py 3.get the out_exponent(Can the current tools generate the exponent value?), and write a config.json file 4.executing convert_tool/convert.py

Thanks!

Auroragan commented 2 years ago

Hi,

the value in the dictionary stands for scale for now, which means int_value = scale * fp_value, the output_exponent value in config json file shoule be -log2(scale) will export to exponent value in the next version

For your own model, the steps are mostly correct: 1.prepare a float32 model, and convet to onnx model

you can basically follow the code in example.py, some modification is needed

model_path = 'mnist_model_example.onnx'

change to your own onnx model path

calib_dataset = test_images[0:5000:50]

the calibration dataset should be selected from your own validation dataset

3/4. get the out_exponent from -log2(scale) and then setp3, step4 as you said the step3/4 will be supported in the quantization tool as well for convenience, but it's still testing, you can experiment it by calling _export_coefficient_to_cpp(model, pickle_file_path, target_chip, outputpath, name). a new version will be released soon.

If there is any question or suggestion, please feel free to let us know

PureHing commented 2 years ago

@Auroragan Hi, In which dynamic library can I locate this function(export_coefficient_to_cpp), and when will the next version be?

Auroragan commented 2 years ago

Hi, @PureHing

Please check the latest master branch.

the function is in calibrator, you can refer to the code in example.py: calib.export_coefficient_to_cpp(model_proto, pickle_file_path, 'esp32s3', '.', 'test_mnist', True)

PureHing commented 2 years ago

@Auroragan Exporting finish, the output files are: ./test_mnist.cpp, ./test_mnist.hpp,and Does the .npy files is necessary for convert.py?

Auroragan commented 2 years ago

the purpose of convert.py is to convert coefficients which are in .npy files to .cpp and .hpp, which is the same as _export_coefficient_tocpp function

if you can use _export_coefficient_tocpp to convert, you don't need to use convert.py anymore

PureHing commented 2 years ago

Thanks much