Closed fanghuaqi closed 5 years ago
The pretrained model is used in optimal
mode to get best pruning ratio from ddpg. The training does not use the pretrained model. If you are using "uniform" or others, pretrained model never be used at all.
@yuanyuanli85 Thanks for your answer, but I am wondering why training don't use pre-trained model, it should be easier to fine tuning from a pre-trained model than from scratch?
Will this project in future consider to use pre-trained model as pruning start base?
@fanghuaqi It is surely possible to use a pre-trained model for warm-start, instead of training from scratch. We may add support for this in the near future. You may also submit a PR to implement this feature (and this is highly welcomed!).
@jiaxiang-wu If we want to implement this feature, what process should we follow? Is there already warm-start code in the PocketFlow, I mean like stub-codes, so developer can fill in it? Or we need to extend it from scratch? I think this feature might be needed for all the learners, it will certainly decrease training time somehow than training from scratch.
Thanks Huaqi
@fanghuaqi You can firstly try to implement the warm-start feature for the weight sparsification learner only. It may be easier to start with.
Hi, I have question about pruning, did the pruning implemented in pocketflow using pre-trained weights as training base, and it is not training from scratch?
Or is this correct?
I looked at the documentation, but didn't find any information about this?
Thanks Huaqi