Open sqPoseidon opened 1 year ago
This seems like a limitation of milr::permuteLoops
, I will look into this and provide more detail
It's actually our limitation. Currently we put the reduction variable outside all the reduction loops, causing inner loops imperfect, thus we cannot directly permute those reduction loops with spatial loops.
In
packed_conv2d_nchw
function, there're four reduction loops:in_channel
,kernel_h
,kernel_w
, andbitwidth
. When I try to move the output channel loop into the reduction loops, I get the error message:Here's the example: