Closed biaoxiaoduan closed 5 years ago
I noticed the NHWC implementation of convolution, and it's extremely fast on AVX2. But it seems Neon version is not implemented, I didn't find source file named SimdNeonConvolution.cpp. Do you have any plan for that?
Yes. I have such plans. Now I'm waiting for ARM dev board (it is arriving 1-2 weeks) to start NEON optimizations.
That's great! I am looking foward to that.
I noticed the NHWC implementation of convolution, and it's extremely fast on AVX2. But it seems Neon version is not implemented, I didn't find source file named SimdNeonConvolution.cpp. Do you have any plan for that?