Closed gngdb closed 9 years ago
OK, this issue is badly titled. It's not an explosion in parameter values, just in the number of parameters due to max pooling problems.
This can be solved by changing the max pooling options carefully. Recommend writing model in notebook and printing the model while changing pool shape and stride.
If you mess this up too badly then the model won't initialise due memory errors... there are more weights the the GPU memory has space for!
Looking at the results from the
Analysing Network
notebook we're getting massive numbers for the parameters in the weight matrix connected the convolutional to fully connected layers. Seems like this is probably something we should avoid. Will look into this using max pooling. Notes will be in the iterative design notebook.