note, that to get the best performance from OMP I had to set omp_threads to be twice as many as physical cores on my CPU at the begining of main.cpp.
It looks like Fastor's performance is far from maximum in this case. Am I doing something wrong?
Note, that if i reduce ths size of vector from 1e5 to 1e3, than Fastor really shines:
Dear @romeric,
I've added Fastor to test it against OpenMP SIMD for std::vector in fastor branch here in the test by 2b-t. I use Fastor this way
the results on my laptop (i7-8550U) are like this:
note, that to get the best performance from OMP I had to set omp_threads to be twice as many as physical cores on my CPU at the begining of main.cpp. It looks like Fastor's performance is far from maximum in this case. Am I doing something wrong?
Note, that if i reduce ths size of vector from 1e5 to 1e3, than Fastor really shines:
Best regards, Konstantin Ladutenko