CONV variant parallelized on output channel dimension

pulp-platform / pulp-nn

Apache License 2.0

76 stars 15 forks source link

CONV variant parallelized on output channel dimension #2

Closed FrancescoConti closed 3 years ago

FrancescoConti commented 4 years ago

In some corner cases, it could be more advantageous to parallelize convolutions over the Kout dimension instead of the Hout/Wout remainder (like in depthwise conv's). Potential advantages:

better parallelism when Hout is small
same reuse pattern

Potential disadvantages:

could trigger more TCDM contention than Hout/Wout parallelization as cores share the same input.

@NBruschi @anga93 what do you think?

NBruschi commented 4 years ago

Added sources and headers of standard convolution and matmul kernels which are named pulp_nn_conv_Co_parallel and pulp_nn_matmul_Co_parallel respectively at the 8bit library.

FrancescoConti commented 4 years ago

Super-fast! I'll keep the issue open till we test it a bit in DORY...