Closed markhun closed 6 months ago
Rebased on top of main (spiral access pattern) and benchmarked locally again:
Before adding bisection of search space
40 solution(s), 2270926 leafs visited
Performance counter stats for './bin/magichex 4 3 14 33 30 34 39 6 24 20' (5 runs):
7.844,51 msec task-clock:u # 1,000 CPUs utilized ( +- 1,13% )
0 context-switches:u # 0,000 /sec
0 cpu-migrations:u # 0,000 /sec
77 page-faults:u # 9,816 /sec ( +- 0,49% )
25.108.781.218 cycles:u # 3,201 GHz ( +- 0,30% ) (62,45%)
66.173.433.794 instructions:u # 2,64 insn per cycle ( +- 0,03% ) (74,96%)
14.245.497.376 branches:u # 1,816 G/sec ( +- 0,05% ) (74,98%)
125.255.986 branch-misses:u # 0,88% of all branches ( +- 0,08% ) (75,01%)
18.529.208.063 L1-dcache-loads:u # 2,362 G/sec ( +- 0,02% ) (75,04%)
3.575.200 L1-dcache-load-misses:u # 0,02% of all L1-dcache accesses ( +- 3,80% ) (75,04%)
397.838 LLC-loads:u # 50,715 K/sec ( +- 7,62% ) (49,98%)
111.003 LLC-load-misses:u # 27,90% of all L1-icache accesses ( +- 10,99% ) (49,95%)
7,8474 +- 0,0880 seconds time elapsed ( +- 1,12% )
After adding the bisection
40 solution(s), 397928 leafs visited
Performance counter stats for './bin/magichex 4 3 14 33 30 34 39 6 24 20' (5 runs):
4.992,97 msec task-clock:u # 0,999 CPUs utilized ( +- 0,48% )
0 context-switches:u # 0,000 /sec
0 cpu-migrations:u # 0,000 /sec
75 page-faults:u # 15,021 /sec ( +- 0,73% )
15.923.544.738 cycles:u # 3,189 GHz ( +- 0,35% ) (62,24%)
40.075.257.183 instructions:u # 2,52 insn per cycle ( +- 0,03% ) (74,83%)
8.600.157.232 branches:u # 1,722 G/sec ( +- 0,03% ) (75,01%)
82.663.397 branch-misses:u # 0,96% of all branches ( +- 0,05% ) (75,16%)
11.154.589.310 L1-dcache-loads:u # 2,234 G/sec ( +- 0,03% ) (75,11%)
1.833.140 L1-dcache-load-misses:u # 0,02% of all L1-dcache accesses ( +- 5,61% ) (75,06%)
292.130 LLC-loads:u # 58,508 K/sec ( +- 3,86% ) (49,88%)
88.981 LLC-load-misses:u # 30,46% of all L1-icache accesses ( +- 1,93% ) (49,77%)
4,9991 +- 0,0215 seconds time elapsed ( +- 0,43% )
Benchmarked locally.
Before:
After: