Closed f-dangel closed 3 years ago
Run time and memory are similar for C_in
and C_in * N
groups. I will stick with the approach that uses C_in
groups for now, because (i) the one-hot kernel is N
times smaller, and (ii) the code does not require one more reshape.
This is an alternative approach for #12 which uses more groups, as the originally proposed optimization (using one group) deteriorated performance.
1
,C_in
, orC_in * N
groups