Question about speeding up inference time.

HolyBayes / pytorch_ard

Pytorch implementation of Variational Dropout Sparsifies Deep Neural Networks

MIT License

83 stars 16 forks source link

I am confused about inference time reduction, since when I run both the MNIST baseline and the MNIST ARD, the inference time is similar for the first 30 epochs.

I am confused about this sentence in the readme. "Model's sparsification takes almost no any speed-up effects until You convert it to the sparse one! (TODO)"

In my run the model is 99% compressed so pretty sparse and still achieve same inference speed up as the baseline.

Also, I am not sure about what "TODO" mean in that context, does it mean that it is not implemented yet in the repo ?

Thank you.

HolyBayes / pytorch_ard

Question about speeding up inference time. #9