Convolution layer(s) with fewer parameters

accosmin / nano

C++ library [machine learning & numerical optimization] - superseeded by libnano

MIT License

1 stars 0 forks source link

Closed accosmin closed 8 years ago

accosmin commented 9 years ago

Reduce the number of parameters from O(#outputs \times #inputs) to O(#outputs).

Some ideas:

random weighted convolutions: O(o) = sum(i, w_o,i * (C_o @ I_i)), w_o,i being fixed
weighted convolutions: same as above, but the weights should be some normalized versions of parameters (e.g. w_o,i = x_o,i / sqrt(1 + x_o,i * x_o,i))

accosmin commented 8 years ago

Also new variation of the linear layer: use normalized weights for parameters (like described above)