QuTech-Delft / qx-simulator

QX Simulator
Other
16 stars 12 forks source link

Improved performance, fixed bugs #84

Closed maxim-masterov closed 3 years ago

maxim-masterov commented 3 years ago

This PR fixes a few significant bugs and drastically improves the performance of the code.

Fixed bugs:

Performance improvements:

Performance was measured on Intel(R) Xeon(R) Silver 4110 CPU @ 2.10GHz, 16 cores (multithreading is off), 93GB RAM:

Commit/PR t, [s] Effective CPU utilization Memory bound NUMA remote access Vectorisation Total 128-bit 256-bit 512-bit Scalar Part Parallel region time Serial time DP GFLOPS CPI rate
1f06044e82ed742104b85e756db579eeafe75afd 45.51 27.20% 65.70% 70.20% 70.60% 60.40% 10.20% 0.00% 29.90% 28.80% 71.20% 3.13 1.65
#83 18.38 71.30% 70.40% 7.20% 70.10% 62.40% 0.00% 7.70% 29.90% 84.80% 15.20% 5.42 1.52
This PR 12.68 84.30% 55.00% 0.30% 81.10% 40.70% 34.40% 5.90% 18.90% 99.10% 0.90% 9.77 1.15