Open powenlo opened 5 years ago
I am still confused about why the shared MLP can be achieved by 1D convolution in the code. It should be FC layers, right? Can anyone explain this to me? (better to give me an example) Thanks a lot.
l have the same,question.
I think convolution can do same things as MLP ,and convolution is easier to train.
I am still confused about why the shared MLP can be achieved by 1D convolution in the code. It should be FC layers, right? Can anyone explain this to me? (better to give me an example) Thanks a lot.