Closed ivllm closed 3 years ago
The network slimming loss is incorporated in the code, see this function https://github.com/Eric-mingjie/rethinking-network-pruning/blob/master/cifar/network-slimming/main.py#L122. We update the scaling factor explicitly.
I understand. Thank you very much for your quick reply!
Hi, Thank you for sharing a good experiment. I have a question about the loss function of network slimming.
The paper shows the training objective as shown below.
But codes only use cross entropy function when training after pruning. Is this the right implementation? Please explain if I misunderstood.