Closed heethesh closed 5 years ago
honestly, never really used this network much with PReLU activations. How can I replicate your issue?
Was trying to deploy this model to C++ using this tutorial and realized that the instances of PReLU activations are the same.
Fixed the problem causing the error. I still have to update the pre-trained models but in the meantime, the changes are in the param_sharing_fix branch.
Try it out and see if it worts now
@davidtvs Thanks, the tracing works fine now!
For future reference - the fix is now on the master branch
Does doing this ensure that PReLU weights are unique for each instance of
activation
within the bottlenecks? While trying to trace this network withtorch.jit
, it gives errors regarding shared weights bynn.PReLU
layers within the submodules. Perhaps this should be implemented withcopy.deepcopy
for all instances?To follow the original paper more closely, the number of channels can be specified for each PReLU instance to learn a weight per channel as shown here.