hi ! I'm interesting in your work, but there is one thing I don't quite understand. Can you tell me how many groups other large kernels need to be divided into and how they are divided (eg. spatial-wise 7×7×7 convolutions is grouped into 3×3×3 splits.)
hi ! I'm interesting in your work, but there is one thing I don't quite understand. Can you tell me how many groups other large kernels need to be divided into and how they are divided (eg. spatial-wise 7×7×7 convolutions is grouped into 3×3×3 splits.)