Open akashAD98 opened 2 years ago
should i need to complete all 300 epoch training to get a quantized model? bcz I'm getting an error when i try to export 150 epoch model, & when i resume this training, it adds extra epochs. here you can see i tried it for 300 epochs but when i stopped training & started resuming training its showed 389 epochs
getting this issue while converting to onnx, whats wrong here? should i need to do continuous training without stop?
even training is not completed ,only 2 epochs remaining ,its giving cuda out of memory ,
Hi @akashAD98
The quantization only happens at the last 2 epochs of the training. This is specified in the recipe file. So if you halt the training before the quantization epoch, you will not get a quantized model.
The screenshot below shows where you can change the quantization epoch.
① num_epochs
is the total number of training epochs.
②quantization_start_epoch
is the exact epoch where quantization begins.
doing resume=True in train.py solved the problem