Closed naitoh closed 6 years ago
This is a very interesting result.
without knowing the exact compile flags for both numpy and narray it doesnt make sense to change anything in the narray source code. differences in optimisation or vectorisation can easily lead to runtime differences like this.
I do not know the reason. My guess is that C compiler failed to generate optimized code, and Numpy has special treatment for that. I have not found how to make it faster.
Thank you for your reply.
I have created a patch that can improve the performance of Broadcast, so I sent a pull request #94.
Best regards.
As for the two questions, the problem has been fixed. Thank you very much.
As a result of comparing Numo::NArray's broadcasting processing speed with numpy, there are two questions.
Environment
Benchmarked code
Result
Best regards.