Closed bobye closed 10 years ago
May have problem if a module has multiple instances. It requires to record dropout in the apply(), and later in back propagation can know which neuron to drop. Further it may be necessary to create another apply() for training purpose.
A possible way to work out this feature is to create a dropout layer (as a nonlinear transform)
May have problem if a module has multiple instances. It requires to record dropout in the apply(), and later in back propagation can know which neuron to drop. Further it may be necessary to create another apply() for training purpose.