Open satabios opened 3 years ago
From my experience, size of the first convolutional filter affects the performance. 5x5x5 filter has 125 elements whereas 7x7x7 almost 3 times more. I experimented with modified version of MinkowskiResnet architecture and when using 1x1x1, 3x3x3 or 5x5x5 first conv layers the performance was similar. For 7x7x7 it was 50% slower and for 9x9x9 over 100% slower (compared to 5x5x5)
It has been mentioned in the paper that
or the first layer, instead of a 7×7 2D convolution, we use a 5×5×5×1 generalized sparse convolution. However, for the rest of the networks, we follow the original network architecture.
why was a 5x5x5x1 convolution chosen?I'm trying to experiment with ResneXt on MinknowskiEngine, any guidance? I just need to introduce groups in the make_layer functions but is there any efficient way of doing so?