While trying out your amazing idea, I noticed a bug when I initialized some parameters, but do not actually used them in training. The error AttributeError: 'NoneType' object has no attribute 'data' occurs because p.grad is None for these unused parameters. This happens since they are not involved in the training process.
I know it is a small contribution, so feel free to reject this pull request if it does not fit your requirements.
While trying out your amazing idea, I noticed a bug when I initialized some parameters, but do not actually used them in training. The error AttributeError: 'NoneType' object has no attribute 'data' occurs because p.grad is None for these unused parameters. This happens since they are not involved in the training process.
I know it is a small contribution, so feel free to reject this pull request if it does not fit your requirements.