lipracer / cuda-rt-hook

BSD 3-Clause "New" or "Revised" License
20 stars 10 forks source link

Speedup the compilation #162

Open dynamicheart opened 1 week ago

dynamicheart commented 1 week ago

发版耗时由15min增加至40min

image image
lipracer commented 1 week ago

clang profiling: ===-------------------------------------------------------------------------=== ... Pass execution timing report ... ===-------------------------------------------------------------------------=== Total Execution Time: 105.3474 seconds (105.3627 wall clock)

---User Time--- --System Time-- --User+System-- ---Wall Time--- --- Name --- 16.8948 ( 16.8%) 1.4964 ( 32.4%) 18.3912 ( 17.5%) 18.4083 ( 17.5%) X86 DAG->DAG Instruction Selection 7.1176 ( 7.1%) 0.2295 ( 5.0%) 7.3471 ( 7.0%) 7.3494 ( 7.0%) Function Integration/Inlining 5.9367 ( 5.9%) 0.0461 ( 1.0%) 5.9829 ( 5.7%) 5.9839 ( 5.7%) SROA #2 3.5782 ( 3.6%) 0.1017 ( 2.2%) 3.6799 ( 3.5%) 3.6807 ( 3.5%) Global Value Numbering 2.2980 ( 2.3%) 0.0495 ( 1.1%) 2.3475 ( 2.2%) 2.3480 ( 2.2%) Combine redundant instructions #2 1.6550 ( 1.6%) 0.0485 ( 1.1%) 1.7035 ( 1.6%) 1.7034 ( 1.6%) ReachingDefAnalysis 1.6289 ( 1.6%) 0.0452 ( 1.0%) 1.6742 ( 1.6%) 1.6748 ( 1.6%) Live Variable Analysis 1.5011 ( 1.5%) 0.0000 ( 0.0%) 1.5011 ( 1.4%) 1.5019 ( 1.4%) Global Variable Optimizer #2 1.3112 ( 1.3%) 0.0377 ( 0.8%) 1.3489 ( 1.3%) 1.3492 ( 1.3%) CodeGen Prepare 1.1704 ( 1.2%) 0.0488 ( 1.1%) 1.2192 ( 1.2%) 1.2192 ( 1.2%) Combine redundant instructions #3 1.1714 ( 1.2%) 0.0372 ( 0.8%) 1.2087 ( 1.1%) 1.2086 ( 1.1%) Combine redundant instructions #5 1.1503 ( 1.1%) 0.0581 ( 1.3%) 1.2084 ( 1.1%) 1.2081 ( 1.1%) Combine redundant instructions #6 1.1614 ( 1.2%) 0.0407 ( 0.9%) 1.2022 ( 1.1%) 1.2017 ( 1.1%) Jump Threading 1.1559 ( 1.1%) 0.0281 ( 0.6%) 1.1840 ( 1.1%) 1.1840 ( 1.1%) Combine redundant instructions #4 1.1416 ( 1.1%) 0.0198 ( 0.4%) 1.1613 ( 1.1%) 1.1610 ( 1.1%) Memory SSA 0.9539 ( 0.9%) 0.1707 ( 3.7%) 1.1245 ( 1.1%) 1.1248 ( 1.1%) SROA 1.0717 ( 1.1%) 0.0179 ( 0.4%) 1.0896 ( 1.0%) 1.0897 ( 1.0%) Early CSE w/ MemorySSA 1.0618 ( 1.1%) 0.0245 ( 0.5%) 1.0863 ( 1.0%) 1.0861 ( 1.0%) Combine redundant instructions #7 1.0537 ( 1.0%) 0.0266 ( 0.6%) 1.0802 ( 1.0%) 1.0803 ( 1.0%) Value Propagation #2 1.0403 ( 1.0%) 0.0185 ( 0.4%) 1.0588 ( 1.0%) 1.0587 ( 1.0%) Combine redundant instructions #8 0.9544 ( 0.9%) 0.0514 ( 1.1%) 1.0058 ( 1.0%) 1.0067 ( 1.0%) Combine redundant instructions 0.9721 ( 1.0%) 0.0210 ( 0.5%) 0.9931 ( 0.9%) 0.9931 ( 0.9%) Branch Probability Basic Block Placement 0.9511 ( 0.9%) 0.0320 ( 0.7%) 0.9831 ( 0.9%) 0.9839 ( 0.9%) Called Value Propagation 0.9243 ( 0.9%) 0.0384 ( 0.8%) 0.9628 ( 0.9%) 0.9624 ( 0.9%) Memory SSA #2 0.9347 ( 0.9%) 0.0171 ( 0.4%) 0.9518 ( 0.9%) 0.9517 ( 0.9%) Memory SSA #4 0.9298 ( 0.9%) 0.0138 ( 0.3%) 0.9436 ( 0.9%) 0.9439 ( 0.9%) Machine Instruction Scheduler 0.9155 ( 0.9%) 0.0220 ( 0.5%) 0.9375 ( 0.9%) 0.9377 ( 0.9%) Dead Store Elimination 0.9167 ( 0.9%) 0.0151 ( 0.3%) 0.9318 ( 0.9%) 0.9324 ( 0.9%) Global Variable Optimizer 0.8977 ( 0.9%) 0.0297 ( 0.6%) 0.9274 ( 0.9%) 0.9272 ( 0.9%) Induction Variable Simplification 0.8691 ( 0.9%) 0.0565 ( 1.2%) 0.9256 ( 0.9%) 0.9262 ( 0.9%) Interprocedural Sparse Conditional Constant Propagation 0.8962 ( 0.9%) 0.0252 ( 0.5%) 0.9215 ( 0.9%) 0.9211 ( 0.9%) Memory SSA #3 0.8675 ( 0.9%) 0.0191 ( 0.4%) 0.8867 ( 0.8%) 0.8867 ( 0.8%) Value Propagation 0.8154 ( 0.8%) 0.0699 ( 1.5%) 0.8852 ( 0.8%) 0.8852 ( 0.8%) Merge disjoint stack slots 0.8149 ( 0.8%) 0.0447 ( 1.0%) 0.8596 ( 0.8%) 0.8602 ( 0.8%) X86 Assembly Printer 0.8131 ( 0.8%) 0.0169 ( 0.4%) 0.8300 ( 0.8%) 0.8304 ( 0.8%) Greedy Register Allocator 0.8144 ( 0.8%) 0.0156 ( 0.3%) 0.8300 ( 0.8%) 0.8297 ( 0.8%) Unroll loops #2 0.7200 ( 0.7%) 0.0260 ( 0.6%) 0.7460 ( 0.7%) 0.7460 ( 0.7%) Machine Common Subexpression Elimination 0.7046 ( 0.7%) 0.0158 ( 0.3%) 0.7204 ( 0.7%) 0.7202 ( 0.7%) Induction Variable Users 0.6722 ( 0.7%) 0.0147 ( 0.3%) 0.6869 ( 0.7%) 0.6865 ( 0.7%) Loop Strength Reduction 0.6553 ( 0.7%) 0.0178 ( 0.4%) 0.6731 ( 0.6%) 0.6731 ( 0.6%) SLP Vectorizer 0.6338 ( 0.6%) 0.0123 ( 0.3%) 0.6460 ( 0.6%) 0.6460 ( 0.6%) Live Interval Analysis 0.6206 ( 0.6%) 0.0183 ( 0.4%) 0.6389 ( 0.6%) 0.6391 ( 0.6%) X86 FP Stackifier 0.6018 ( 0.6%) 0.0331 ( 0.7%) 0.6348 ( 0.6%) 0.6349 ( 0.6%) Loop Load Elimination 0.5488 ( 0.5%) 0.0180 ( 0.4%) 0.5667 ( 0.5%) 0.5668 ( 0.5%) X86 Byte/Word Instruction Fixup 0.5202 ( 0.5%) 0.0268 ( 0.6%) 0.5470 ( 0.5%) 0.5472 ( 0.5%) Branch Probability Analysis #2 0.4398 ( 0.4%) 0.0958 ( 2.1%) 0.5356 ( 0.5%) 0.5377 ( 0.5%) Simplify the CFG 0.4648 ( 0.5%) 0.0252 ( 0.5%) 0.4900 ( 0.5%) 0.4899 ( 0.5%) Branch Probability Analysis #6 0.4629 ( 0.5%) 0.0133 ( 0.3%) 0.4762 ( 0.5%) 0.4763 ( 0.5%) Machine Copy Propagation Pass 0.4442 ( 0.4%) 0.0144 ( 0.3%) 0.4586 ( 0.4%) 0.4585 ( 0.4%) Control Flow Optimizer 0.4277 ( 0.4%) 0.0209 ( 0.5%) 0.4486 ( 0.4%) 0.4486 ( 0.4%) Branch Probability Analysis #4 0.4156 ( 0.4%) 0.0282 ( 0.6%) 0.4438 ( 0.4%) 0.4438 ( 0.4%) Branch Probability Analysis #3 0.4114 ( 0.4%) 0.0170 ( 0.4%) 0.4284 ( 0.4%) 0.4282 ( 0.4%) Branch Probability Analysis #5 0.4039 ( 0.4%) 0.0129 ( 0.3%) 0.4168 ( 0.4%) 0.4166 ( 0.4%) Sparse Conditional Constant Propagation 0.3871 ( 0.4%) 0.0219 ( 0.5%) 0.4090 ( 0.4%) 0.4090 ( 0.4%) Simplify the CFG #5 0.3793 ( 0.4%) 0.0182 ( 0.4%) 0.3975 ( 0.4%) 0.3974 ( 0.4%) Simplify the CFG #6 0.3617 ( 0.4%) 0.0209 ( 0.5%) 0.3826 ( 0.4%) 0.3824 ( 0.4%) Aggressive Dead Code Elimination 0.3739 ( 0.4%) 0.0048 ( 0.1%) 0.3787 ( 0.4%) 0.3788 ( 0.4%) Simple Register Coalescing 0.3576 ( 0.4%) 0.0142 ( 0.3%) 0.3718 ( 0.4%) 0.3717 ( 0.4%) Simplify the CFG #7 0.3536 ( 0.4%) 0.0095 ( 0.2%) 0.3631 ( 0.3%) 0.3631 ( 0.3%) Machine Copy Propagation Pass #2 0.3462 ( 0.3%) 0.0152 ( 0.3%) 0.3614 ( 0.3%) 0.3614 ( 0.3%) Simplify the CFG #2 0.3405 ( 0.3%) 0.0113 ( 0.2%) 0.3519 ( 0.3%) 0.3516 ( 0.3%) MachineDominator Tree Construction #9 0.3320 ( 0.3%) 0.0106 ( 0.2%) 0.3426 ( 0.3%) 0.3425 ( 0.3%) Dominator Tree Construction #7 0.3208 ( 0.3%) 0.0213 ( 0.5%) 0.3421 ( 0.3%) 0.3421 ( 0.3%) Simplify the CFG #3 0.3033 ( 0.3%) 0.0088 ( 0.2%) 0.3121 ( 0.3%) 0.3119 ( 0.3%) Bit-Tracking Dead Code Elimination 0.2953 ( 0.3%) 0.0163 ( 0.4%) 0.3116 ( 0.3%) 0.3114 ( 0.3%) Post-Dominator Tree Construction 0.2897 ( 0.3%) 0.0115 ( 0.2%) 0.3012 ( 0.3%) 0.3011 ( 0.3%) Simplify the CFG #8 0.2915 ( 0.3%) 0.0078 ( 0.2%) 0.2993 ( 0.3%) 0.2991 ( 0.3%) Eliminate PHI nodes for register allocation 0.2799 ( 0.3%) 0.0147 ( 0.3%) 0.2946 ( 0.3%) 0.2945 ( 0.3%) Dominator Tree Construction #21 0.2780 ( 0.3%) 0.0134 ( 0.3%) 0.2914 ( 0.3%) 0.2913 ( 0.3%) Dominator Tree Construction #12 0.2797 ( 0.3%) 0.0096 ( 0.2%) 0.2894 ( 0.3%) 0.2894 ( 0.3%) Prologue/Epilogue Insertion & Frame Finalization 0.2702 ( 0.3%) 0.0146 ( 0.3%) 0.2848 ( 0.3%) 0.2847 ( 0.3%) Reassociate expressions 0.2566 ( 0.3%) 0.0117 ( 0.3%) 0.2682 ( 0.3%) 0.2681 ( 0.3%) Jump Threading #2 0.2560 ( 0.3%) 0.0123 ( 0.3%) 0.2682 ( 0.3%) 0.2681 ( 0.3%) Simplify the CFG #4 0.2556 ( 0.3%) 0.0062 ( 0.1%) 0.2618 ( 0.2%) 0.2617 ( 0.2%) Machine code sinking 0.2515 ( 0.2%) 0.0056 ( 0.1%) 0.2570 ( 0.2%) 0.2570 ( 0.2%) Branch Probability Analysis 0.2457 ( 0.2%) 0.0096 ( 0.2%) 0.2553 ( 0.2%) 0.2554 ( 0.2%) Remove redundant instructions 0.2338 ( 0.2%) 0.0059 ( 0.1%) 0.2397 ( 0.2%) 0.2397 ( 0.2%) Virtual Register Rewriter 0.2005 ( 0.2%) 0.0343 ( 0.7%) 0.2349 ( 0.2%) 0.2358 ( 0.2%) Early CSE 0.2223 ( 0.2%) 0.0088 ( 0.2%) 0.2311 ( 0.2%) 0.2310 ( 0.2%) Dominator Tree Construction #15 0.2239 ( 0.2%) 0.0061 ( 0.1%) 0.2300 ( 0.2%) 0.2299 ( 0.2%) Peephole Optimizations 0.2189 ( 0.2%) 0.0073 ( 0.2%) 0.2263 ( 0.2%) 0.2261 ( 0.2%) MachinePostDominator Tree Construction #2 0.2142 ( 0.2%) 0.0112 ( 0.2%) 0.2254 ( 0.2%) 0.2254 ( 0.2%) Dominator Tree Construction #6 0.2012 ( 0.2%) 0.0215 ( 0.5%) 0.2226 ( 0.2%) 0.2225 ( 0.2%) MachineDominator Tree Construction 0.2104 ( 0.2%) 0.0106 ( 0.2%) 0.2210 ( 0.2%) 0.2209 ( 0.2%) Dominator Tree Construction #11 0.2109 ( 0.2%) 0.0100 ( 0.2%) 0.2209 ( 0.2%) 0.2208 ( 0.2%) Dominator Tree Construction #9 0.2086 ( 0.2%) 0.0088 ( 0.2%) 0.2174 ( 0.2%) 0.2173 ( 0.2%) Dominator Tree Construction #8 0.2102 ( 0.2%) 0.0059 ( 0.1%) 0.2161 ( 0.2%) 0.2160 ( 0.2%) MachinePostDominator Tree Construction 0.2094 ( 0.2%) 0.0066 ( 0.1%) 0.2160 ( 0.2%) 0.2159 ( 0.2%) Live Range Shrink 0.2015 ( 0.2%) 0.0128 ( 0.3%) 0.2143 ( 0.2%) 0.2140 ( 0.2%) Dominator Tree Construction #10 0.2054 ( 0.2%) 0.0073 ( 0.2%) 0.2127 ( 0.2%) 0.2126 ( 0.2%) X86 Execution Dependency Fix 0.1951 ( 0.2%) 0.0137 ( 0.3%) 0.2088 ( 0.2%) 0.2086 ( 0.2%) Dominator Tree Construction #16 0.2017 ( 0.2%) 0.0045 ( 0.1%) 0.2062 ( 0.2%) 0.2060 ( 0.2%) Dominator Tree Construction #17 0.1900 ( 0.2%) 0.0144 ( 0.3%) 0.2044 ( 0.2%) 0.2047 ( 0.2%) Dominator Tree Construction #3 0.1830 ( 0.2%) 0.0174 ( 0.4%) 0.2005 ( 0.2%) 0.2003 ( 0.2%) Dominator Tree Construction #2 0.1895 ( 0.2%) 0.0060 ( 0.1%) 0.1955 ( 0.2%) 0.1955 ( 0.2%) Constant Hoisting 0.1886 ( 0.2%) 0.0053 ( 0.1%) 0.1939 ( 0.2%) 0.1938 ( 0.2%) MachineDominator Tree Construction #6 0.1877 ( 0.2%) 0.0062 ( 0.1%) 0.1939 ( 0.2%) 0.1938 ( 0.2%) MachineDominator Tree Construction #7 0.1858 ( 0.2%) 0.0078 ( 0.2%) 0.1936 ( 0.2%) 0.1935 ( 0.2%) Tail Call Elimination 0.1820 ( 0.2%) 0.0112 ( 0.2%) 0.1932 ( 0.2%) 0.1929 ( 0.2%) Dominator Tree Construction #22 0.1867 ( 0.2%) 0.0049 ( 0.1%) 0.1916 ( 0.2%) 0.1915 ( 0.2%) MachinePostDominator Tree Construction #3 0.1773 ( 0.2%) 0.0064 ( 0.1%) 0.1837 ( 0.2%) 0.1836 ( 0.2%) Dominator Tree Construction #19 0.1745 ( 0.2%) 0.0092 ( 0.2%) 0.1838 ( 0.2%) 0.1828 ( 0.2%) Dominator Tree Construction #5 0.1763 ( 0.2%) 0.0056 ( 0.1%) 0.1819 ( 0.2%) 0.1817 ( 0.2%) Dominator Tree Construction #14 0.1758 ( 0.2%) 0.0041 ( 0.1%) 0.1799 ( 0.2%) 0.1797 ( 0.2%) MachineDominator Tree Construction #8 0.1795 ( 0.2%) 0.0000 ( 0.0%) 0.1795 ( 0.2%) 0.1796 ( 0.2%) CallGraph Construction #2 0.1667 ( 0.2%) 0.0121 ( 0.3%) 0.1787 ( 0.2%) 0.1789 ( 0.2%) MachineDominator Tree Construction #2 0.1721 ( 0.2%) 0.0056 ( 0.1%) 0.1777 ( 0.2%) 0.1777 ( 0.2%) Dominator Tree Construction #18 0.1671 ( 0.2%) 0.0102 ( 0.2%) 0.1773 ( 0.2%) 0.1772 ( 0.2%) Block Frequency Analysis #2 0.1669 ( 0.2%) 0.0055 ( 0.1%) 0.1724 ( 0.2%) 0.1723 ( 0.2%) MachineDominator Tree Construction #5 0.1584 ( 0.2%) 0.0094 ( 0.2%) 0.1678 ( 0.2%) 0.1676 ( 0.2%) Insert stack protectors 0.1668 ( 0.2%) 0.0000 ( 0.0%) 0.1668 ( 0.2%) 0.1668 ( 0.2%) CallGraph Construction #3 0.1578 ( 0.2%) 0.0042 ( 0.1%) 0.1620 ( 0.2%) 0.1619 ( 0.2%) Two-Address instruction pass 0.1521 ( 0.2%) 0.0075 ( 0.2%) 0.1596 ( 0.2%) 0.1595 ( 0.2%) Dominator Tree Construction #13 0.1540 ( 0.2%) 0.0055 ( 0.1%) 0.1596 ( 0.2%) 0.1594 ( 0.2%) Dominator Tree Construction #20 0.1470 ( 0.1%) 0.0100 ( 0.2%) 0.1570 ( 0.1%) 0.1570 ( 0.1%) MachineDominator Tree Construction #3 0.1456 ( 0.1%) 0.0087 ( 0.2%) 0.1543 ( 0.1%) 0.1542 ( 0.1%) MachineDominator Tree Construction #4 0.1259 ( 0.1%) 0.0246 ( 0.5%) 0.1505 ( 0.1%) 0.1504 ( 0.1%) Dominator Tree Construction 0.1349 ( 0.1%) 0.0105 ( 0.2%) 0.1454 ( 0.1%) 0.1453 ( 0.1%) Remove dead machine instructions 0.1396 ( 0.1%) 0.0049 ( 0.1%) 0.1445 ( 0.1%) 0.1445 ( 0.1%) MemCpy Optimization 0.1322 ( 0.1%) 0.0090 ( 0.2%) 0.1412 ( 0.1%) 0.1413 ( 0.1%) CallGraph Construction 0.1340 ( 0.1%) 0.0028 ( 0.1%) 0.1368 ( 0.1%) 0.1370 ( 0.1%) Promote 'by reference' arguments to scalars 0.1257 ( 0.1%) 0.0064 ( 0.1%) 0.1322 ( 0.1%) 0.1322 ( 0.1%) Block Frequency Analysis #3 0.1265 ( 0.1%) 0.0033 ( 0.1%) 0.1298 ( 0.1%) 0.1301 ( 0.1%) Expand Atomic instructions 0.1242 ( 0.1%) 0.0040 ( 0.1%) 0.1282 ( 0.1%) 0.1282 ( 0.1%) Block Frequency Analysis #6 0.1211 ( 0.1%) 0.0036 ( 0.1%) 0.1246 ( 0.1%) 0.1245 ( 0.1%) Remove dead machine instructions #2 0.1164 ( 0.1%) 0.0019 ( 0.0%) 0.1183 ( 0.1%) 0.1183 ( 0.1%) Machine Block Frequency Analysis #2 0.1084 ( 0.1%) 0.0084 ( 0.2%) 0.1168 ( 0.1%) 0.1170 ( 0.1%) Remove unused exception handling info 0.1105 ( 0.1%) 0.0048 ( 0.1%) 0.1153 ( 0.1%) 0.1152 ( 0.1%) Natural Loop Information #4 0.1092 ( 0.1%) 0.0058 ( 0.1%) 0.1150 ( 0.1%) 0.1149 ( 0.1%) Natural Loop Information #3 0.1061 ( 0.1%) 0.0072 ( 0.2%) 0.1134 ( 0.1%) 0.1133 ( 0.1%) Natural Loop Information #7 0.1097 ( 0.1%) 0.0035 ( 0.1%) 0.1132 ( 0.1%) 0.1131 ( 0.1%) Block Frequency Analysis #4 0.1047 ( 0.1%) 0.0053 ( 0.1%) 0.1100 ( 0.1%) 0.1101 ( 0.1%) Natural Loop Information #8 0.1034 ( 0.1%) 0.0066 ( 0.1%) 0.1100 ( 0.1%) 0.1100 ( 0.1%) Machine Block Frequency Analysis 0.1083 ( 0.1%) 0.0020 ( 0.0%) 0.1102 ( 0.1%) 0.1100 ( 0.1%) Loop Invariant Code Motion #3 0.1082 ( 0.1%) 0.0018 ( 0.0%) 0.1100 ( 0.1%) 0.1098 ( 0.1%) Loop Invariant Code Motion #2 0.1052 ( 0.1%) 0.0040 ( 0.1%) 0.1092 ( 0.1%) 0.1091 ( 0.1%) Loop Invariant Code Motion 0.1004 ( 0.1%) 0.0035 ( 0.1%) 0.1038 ( 0.1%) 0.1037 ( 0.1%) X86 LEA Optimize 0.1009 ( 0.1%) 0.0028 ( 0.1%) 0.1037 ( 0.1%) 0.1036 ( 0.1%) Natural Loop Information #14 0.0978 ( 0.1%) 0.0052 ( 0.1%) 0.1030 ( 0.1%) 0.1027 ( 0.1%) Natural Loop Information #6 0.0968 ( 0.1%) 0.0057 ( 0.1%) 0.1024 ( 0.1%) 0.1024 ( 0.1%) Natural Loop Information #9 0.0969 ( 0.1%) 0.0051 ( 0.1%) 0.1021 ( 0.1%) 0.1022 ( 0.1%) Float to int 0.0992 ( 0.1%) 0.0026 ( 0.1%) 0.1018 ( 0.1%) 0.1016 ( 0.1%) Machine Block Frequency Analysis #4 0.0968 ( 0.1%) 0.0048 ( 0.1%) 0.1016 ( 0.1%) 0.1016 ( 0.1%) Natural Loop Information #17 0.0973 ( 0.1%) 0.0032 ( 0.1%) 0.1006 ( 0.1%) 0.1006 ( 0.1%) Machine Block Frequency Analysis #3 0.0935 ( 0.1%) 0.0030 ( 0.1%) 0.0964 ( 0.1%) 0.0961 ( 0.1%) Natural Loop Information #11 0.0918 ( 0.1%) 0.0044 ( 0.1%) 0.0962 ( 0.1%) 0.0960 ( 0.1%) Natural Loop Information #10 0.0938 ( 0.1%) 0.0019 ( 0.0%) 0.0957 ( 0.1%) 0.0959 ( 0.1%) Dominator Tree Construction #4 0.0936 ( 0.1%) 0.0022 ( 0.0%) 0.0958 ( 0.1%) 0.0958 ( 0.1%) Natural Loop Information #13 0.0920 ( 0.1%) 0.0036 ( 0.1%) 0.0956 ( 0.1%) 0.0955 ( 0.1%) Natural Loop Information #12 0.0883 ( 0.1%) 0.0043 ( 0.1%) 0.0926 ( 0.1%) 0.0925 ( 0.1%) Natural Loop Information #5 0.0896 ( 0.1%) 0.0000 ( 0.0%) 0.0896 ( 0.1%) 0.0897 ( 0.1%) Dead Global Elimination #2 0.0889 ( 0.1%) 0.0000 ( 0.0%) 0.0889 ( 0.1%) 0.0890 ( 0.1%) Dead Global Elimination 0.0857 ( 0.1%) 0.0025 ( 0.1%) 0.0882 ( 0.1%) 0.0880 ( 0.1%) Slot index numbering #2 0.0766 ( 0.1%) 0.0077 ( 0.2%) 0.0844 ( 0.1%) 0.0841 ( 0.1%) Slot index numbering 0.0822 ( 0.1%) 0.0018 ( 0.0%) 0.0840 ( 0.1%) 0.0839 ( 0.1%) Block Frequency Analysis 0.0811 ( 0.1%) 0.0026 ( 0.1%) 0.0837 ( 0.1%) 0.0836 ( 0.1%) Block Frequency Analysis #5 0.0788 ( 0.1%) 0.0045 ( 0.1%) 0.0833 ( 0.1%) 0.0832 ( 0.1%) Natural Loop Information #2 0.0774 ( 0.1%) 0.0027 ( 0.1%) 0.0801 ( 0.1%) 0.0800 ( 0.1%) Machine Natural Loop Construction #5 0.0777 ( 0.1%) 0.0000 ( 0.0%) 0.0777 ( 0.1%) 0.0777 ( 0.1%) Dead Argument Elimination 0.0724 ( 0.1%) 0.0051 ( 0.1%) 0.0774 ( 0.1%) 0.0774 ( 0.1%) Machine InstCombiner 0.0737 ( 0.1%) 0.0028 ( 0.1%) 0.0764 ( 0.1%) 0.0764 ( 0.1%) Natural Loop Information #16 0.0639 ( 0.1%) 0.0125 ( 0.3%) 0.0764 ( 0.1%) 0.0761 ( 0.1%) Deduce function attributes 0.0740 ( 0.1%) 0.0018 ( 0.0%) 0.0758 ( 0.1%) 0.0757 ( 0.1%) Machine Natural Loop Construction #4 0.0672 ( 0.1%) 0.0070 ( 0.2%) 0.0742 ( 0.1%) 0.0741 ( 0.1%) Early Tail Duplication 0.0718 ( 0.1%) 0.0024 ( 0.1%) 0.0742 ( 0.1%) 0.0741 ( 0.1%) Natural Loop Information #15 0.0679 ( 0.1%) 0.0049 ( 0.1%) 0.0729 ( 0.1%) 0.0728 ( 0.1%) Machine Natural Loop Construction 0.0695 ( 0.1%) 0.0021 ( 0.0%) 0.0716 ( 0.1%) 0.0715 ( 0.1%) Machine Natural Loop Construction #3 0.0662 ( 0.1%) 0.0034 ( 0.1%) 0.0696 ( 0.1%) 0.0696 ( 0.1%) Lower constant intrinsics 0.0633 ( 0.1%) 0.0013 ( 0.0%) 0.0646 ( 0.1%) 0.0646 ( 0.1%) Natural Loop Information 0.0603 ( 0.1%) 0.0027 ( 0.1%) 0.0630 ( 0.1%) 0.0629 ( 0.1%) Unroll loops 0.0598 ( 0.1%) 0.0020 ( 0.0%) 0.0618 ( 0.1%) 0.0616 ( 0.1%) Lower constant intrinsics #2 0.0548 ( 0.1%) 0.0061 ( 0.1%) 0.0609 ( 0.1%) 0.0604 ( 0.1%) Scalar Evolution Analysis #2 0.0545 ( 0.1%) 0.0038 ( 0.1%) 0.0583 ( 0.1%) 0.0582 ( 0.1%) Machine Natural Loop Construction #2 0.0548 ( 0.1%) 0.0029 ( 0.1%) 0.0577 ( 0.1%) 0.0578 ( 0.1%) Loop Vectorization 0.0543 ( 0.1%) 0.0032 ( 0.1%) 0.0575 ( 0.1%) 0.0573 ( 0.1%) Combine pattern based expressions 0.0506 ( 0.1%) 0.0066 ( 0.1%) 0.0572 ( 0.1%) 0.0572 ( 0.1%) Call-site splitting 0.0546 ( 0.1%) 0.0017 ( 0.0%) 0.0563 ( 0.1%) 0.0562 ( 0.1%) Expand memcmp() to load/stores 0.0517 ( 0.1%) 0.0021 ( 0.0%) 0.0538 ( 0.1%) 0.0539 ( 0.1%) Unswitch loops 0.0512 ( 0.1%) 0.0022 ( 0.0%) 0.0534 ( 0.1%) 0.0533 ( 0.1%) Remove unreachable machine basic blocks 0.0508 ( 0.1%) 0.0025 ( 0.1%) 0.0533 ( 0.1%) 0.0532 ( 0.1%) Free MachineFunction 0.0468 ( 0.0%) 0.0032 ( 0.1%) 0.0500 ( 0.0%) 0.0500 ( 0.0%) Early Machine Loop Invariant Code Motion 0.0474 ( 0.0%) 0.0018 ( 0.0%) 0.0491 ( 0.0%) 0.0492 ( 0.0%) X86 Optimize Call Frame 0.0470 ( 0.0%) 0.0017 ( 0.0%) 0.0487 ( 0.0%) 0.0485 ( 0.0%) X86 EFLAGS copy lowering 0.0469 ( 0.0%) 0.0016 ( 0.0%) 0.0485 ( 0.0%) 0.0484 ( 0.0%) X86 Fixup SetCC 0.0457 ( 0.0%) 0.0016 ( 0.0%) 0.0473 ( 0.0%) 0.0471 ( 0.0%) PostRA Machine Sink 0.0425 ( 0.0%) 0.0033 ( 0.1%) 0.0457 ( 0.0%) 0.0457 ( 0.0%) Canonicalize natural loops 0.0390 ( 0.0%) 0.0028 ( 0.1%) 0.0418 ( 0.0%) 0.0417 ( 0.0%) Lazy Value Information Analysis #2 0.0403 ( 0.0%) 0.0014 ( 0.0%) 0.0417 ( 0.0%) 0.0417 ( 0.0%) Remove unreachable blocks from the CFG 0.0348 ( 0.0%) 0.0060 ( 0.1%) 0.0409 ( 0.0%) 0.0403 ( 0.0%) Scalar Evolution Analysis 0.0370 ( 0.0%) 0.0033 ( 0.1%) 0.0403 ( 0.0%) 0.0402 ( 0.0%) Canonicalize natural loops #2 0.0391 ( 0.0%) 0.0011 ( 0.0%) 0.0402 ( 0.0%) 0.0400 ( 0.0%) Scalar Evolution Analysis #11 0.0360 ( 0.0%) 0.0038 ( 0.1%) 0.0398 ( 0.0%) 0.0397 ( 0.0%) Finalize ISel and expand pseudo-instructions 0.0365 ( 0.0%) 0.0014 ( 0.0%) 0.0379 ( 0.0%) 0.0380 ( 0.0%) BreakFalseDeps 0.0364 ( 0.0%) 0.0012 ( 0.0%) 0.0376 ( 0.0%) 0.0376 ( 0.0%) X86 LEA Fixup 0.0343 ( 0.0%) 0.0025 ( 0.1%) 0.0368 ( 0.0%) 0.0367 ( 0.0%) Lazy Value Information Analysis 0.0295 ( 0.0%) 0.0053 ( 0.1%) 0.0347 ( 0.0%) 0.0343 ( 0.0%) Scalar Evolution Analysis #3 0.0331 ( 0.0%) 0.0010 ( 0.0%) 0.0341 ( 0.0%) 0.0341 ( 0.0%) Post-RA pseudo instruction expansion pass 0.0334 ( 0.0%) 0.0008 ( 0.0%) 0.0342 ( 0.0%) 0.0341 ( 0.0%) Canonicalize natural loops #6 0.0322 ( 0.0%) 0.0011 ( 0.0%) 0.0333 ( 0.0%) 0.0332 ( 0.0%) Partially inline calls to library functions 0.0308 ( 0.0%) 0.0023 ( 0.0%) 0.0330 ( 0.0%) 0.0329 ( 0.0%) Conditionally eliminate dead library calls 0.0315 ( 0.0%) 0.0013 ( 0.0%) 0.0328 ( 0.0%) 0.0327 ( 0.0%) Check CFA info and insert CFI instructions if needed 0.0318 ( 0.0%) 0.0008 ( 0.0%) 0.0326 ( 0.0%) 0.0325 ( 0.0%) Scalar Evolution Analysis #8 0.0307 ( 0.0%) 0.0007 ( 0.0%) 0.0314 ( 0.0%) 0.0313 ( 0.0%) Canonicalize natural loops #9 0.0297 ( 0.0%) 0.0007 ( 0.0%) 0.0304 ( 0.0%) 0.0307 ( 0.0%) Globals Alias Analysis 0.0274 ( 0.0%) 0.0018 ( 0.0%) 0.0291 ( 0.0%) 0.0291 ( 0.0%) Exception handling preparation 0.0248 ( 0.0%) 0.0038 ( 0.1%) 0.0286 ( 0.0%) 0.0286 ( 0.0%) Function Alias Analysis Results #2 0.0270 ( 0.0%) 0.0017 ( 0.0%) 0.0287 ( 0.0%) 0.0285 ( 0.0%) Scalar Evolution Analysis #6 0.0263 ( 0.0%) 0.0019 ( 0.0%) 0.0282 ( 0.0%) 0.0282 ( 0.0%) X86 cmov Conversion 0.0269 ( 0.0%) 0.0013 ( 0.0%) 0.0282 ( 0.0%) 0.0282 ( 0.0%) Canonicalize natural loops #4 0.0268 ( 0.0%) 0.0011 ( 0.0%) 0.0279 ( 0.0%) 0.0280 ( 0.0%) Recognize loop idioms 0.0256 ( 0.0%) 0.0023 ( 0.0%) 0.0278 ( 0.0%) 0.0278 ( 0.0%) PGOMemOPSize 0.0241 ( 0.0%) 0.0027 ( 0.1%) 0.0267 ( 0.0%) 0.0267 ( 0.0%) Canonicalize natural loops #3 0.0254 ( 0.0%) 0.0012 ( 0.0%) 0.0266 ( 0.0%) 0.0266 ( 0.0%) Rotate Loops 0.0226 ( 0.0%) 0.0037 ( 0.1%) 0.0263 ( 0.0%) 0.0263 ( 0.0%) Function Alias Analysis Results #9 0.0227 ( 0.0%) 0.0035 ( 0.1%) 0.0262 ( 0.0%) 0.0262 ( 0.0%) Function Alias Analysis Results #3 0.0227 ( 0.0%) 0.0035 ( 0.1%) 0.0262 ( 0.0%) 0.0261 ( 0.0%) Function Alias Analysis Results #4 0.0226 ( 0.0%) 0.0035 ( 0.1%) 0.0261 ( 0.0%) 0.0260 ( 0.0%) Function Alias Analysis Results #5 0.0218 ( 0.0%) 0.0041 ( 0.1%) 0.0259 ( 0.0%) 0.0259 ( 0.0%) Function Alias Analysis Results #13 0.0249 ( 0.0%) 0.0008 ( 0.0%) 0.0257 ( 0.0%) 0.0257 ( 0.0%) Machine Loop Invariant Code Motion 0.0218 ( 0.0%) 0.0038 ( 0.1%) 0.0256 ( 0.0%) 0.0256 ( 0.0%) Function Alias Analysis Results #8 0.0241 ( 0.0%) 0.0010 ( 0.0%) 0.0250 ( 0.0%) 0.0250 ( 0.0%) Tail Duplication 0.0212 ( 0.0%) 0.0038 ( 0.1%) 0.0250 ( 0.0%) 0.0249 ( 0.0%) Function Alias Analysis Results #7 0.0236 ( 0.0%) 0.0009 ( 0.0%) 0.0245 ( 0.0%) 0.0245 ( 0.0%) Interleaved Access Pass 0.0208 ( 0.0%) 0.0037 ( 0.1%) 0.0245 ( 0.0%) 0.0245 ( 0.0%) Function Alias Analysis Results #6 0.0207 ( 0.0%) 0.0037 ( 0.1%) 0.0245 ( 0.0%) 0.0244 ( 0.0%) Function Alias Analysis Results #11 0.0223 ( 0.0%) 0.0011 ( 0.0%) 0.0235 ( 0.0%) 0.0235 ( 0.0%) Rotate Loops #2 0.0196 ( 0.0%) 0.0035 ( 0.1%) 0.0231 ( 0.0%) 0.0231 ( 0.0%) Function Alias Analysis Results #10 0.0211 ( 0.0%) 0.0008 ( 0.0%) 0.0218 ( 0.0%) 0.0216 ( 0.0%) Scalar Evolution Analysis #7 0.0180 ( 0.0%) 0.0035 ( 0.1%) 0.0215 ( 0.0%) 0.0215 ( 0.0%) Function Alias Analysis Results #12 0.0188 ( 0.0%) 0.0025 ( 0.1%) 0.0213 ( 0.0%) 0.0213 ( 0.0%) Basic Alias Analysis (stateless AA impl) #2 0.0206 ( 0.0%) 0.0007 ( 0.0%) 0.0213 ( 0.0%) 0.0213 ( 0.0%) X86 pseudo instruction expansion pass 0.0185 ( 0.0%) 0.0024 ( 0.1%) 0.0209 ( 0.0%) 0.0207 ( 0.0%) Memory Dependence Analysis 0.0185 ( 0.0%) 0.0020 ( 0.0%) 0.0205 ( 0.0%) 0.0206 ( 0.0%) Loop-Closed SSA Form Pass 0.0189 ( 0.0%) 0.0016 ( 0.0%) 0.0204 ( 0.0%) 0.0205 ( 0.0%) Function Alias Analysis Results 0.0184 ( 0.0%) 0.0019 ( 0.0%) 0.0203 ( 0.0%) 0.0205 ( 0.0%) Loop-Closed SSA Form Pass #3 0.0198 ( 0.0%) 0.0007 ( 0.0%) 0.0205 ( 0.0%) 0.0204 ( 0.0%) Scalarize Masked Memory Intrinsics 0.0194 ( 0.0%) 0.0008 ( 0.0%) 0.0202 ( 0.0%) 0.0201 ( 0.0%) Expand reduction intrinsics 0.0177 ( 0.0%) 0.0021 ( 0.0%) 0.0198 ( 0.0%) 0.0199 ( 0.0%) MergedLoadStoreMotion 0.0187 ( 0.0%) 0.0008 ( 0.0%) 0.0195 ( 0.0%) 0.0195 ( 0.0%) Hoist/decompose integer division and remainder 0.0185 ( 0.0%) 0.0007 ( 0.0%) 0.0193 ( 0.0%) 0.0193 ( 0.0%) X86 Avoid Store Forwarding Blocks 0.0167 ( 0.0%) 0.0019 ( 0.0%) 0.0186 ( 0.0%) 0.0187 ( 0.0%) Loop-Closed SSA Form Pass #2 0.0177 ( 0.0%) 0.0004 ( 0.0%) 0.0180 ( 0.0%) 0.0180 ( 0.0%) Debug Variable Analysis 0.0152 ( 0.0%) 0.0025 ( 0.1%) 0.0177 ( 0.0%) 0.0178 ( 0.0%) Lower 'expect' Intrinsics 0.0168 ( 0.0%) 0.0004 ( 0.0%) 0.0172 ( 0.0%) 0.0172 ( 0.0%) Canonicalize natural loops #7 0.0149 ( 0.0%) 0.0023 ( 0.0%) 0.0172 ( 0.0%) 0.0171 ( 0.0%) Basic Alias Analysis (stateless AA impl) #5 0.0165 ( 0.0%) 0.0006 ( 0.0%) 0.0171 ( 0.0%) 0.0171 ( 0.0%) Bundle Machine CFG Edges #2 0.0143 ( 0.0%) 0.0027 ( 0.1%) 0.0170 ( 0.0%) 0.0170 ( 0.0%) Basic Alias Analysis (stateless AA impl) #13 0.0147 ( 0.0%) 0.0022 ( 0.0%) 0.0168 ( 0.0%) 0.0170 ( 0.0%) Basic Alias Analysis (stateless AA impl) #3 0.0143 ( 0.0%) 0.0025 ( 0.1%) 0.0168 ( 0.0%) 0.0168 ( 0.0%) Basic Alias Analysis (stateless AA impl) #11 0.0144 ( 0.0%) 0.0022 ( 0.0%) 0.0166 ( 0.0%) 0.0167 ( 0.0%) Basic Alias Analysis (stateless AA impl) #4 0.0159 ( 0.0%) 0.0007 ( 0.0%) 0.0165 ( 0.0%) 0.0167 ( 0.0%) Delete dead loops 0.0163 ( 0.0%) 0.0004 ( 0.0%) 0.0167 ( 0.0%) 0.0166 ( 0.0%) Bundle Machine CFG Edges 0.0140 ( 0.0%) 0.0026 ( 0.1%) 0.0166 ( 0.0%) 0.0166 ( 0.0%) Basic Alias Analysis (stateless AA impl) #7 0.0159 ( 0.0%) 0.0007 ( 0.0%) 0.0166 ( 0.0%) 0.0166 ( 0.0%) Process Implicit Definitions 0.0143 ( 0.0%) 0.0021 ( 0.0%) 0.0164 ( 0.0%) 0.0165 ( 0.0%) Speculatively execute instructions if target has divergent branches 0.0157 ( 0.0%) 0.0007 ( 0.0%) 0.0165 ( 0.0%) 0.0164 ( 0.0%) Stack Slot Coloring 0.0142 ( 0.0%) 0.0012 ( 0.0%) 0.0154 ( 0.0%) 0.0154 ( 0.0%) Basic Alias Analysis (stateless AA impl) 0.0146 ( 0.0%) 0.0004 ( 0.0%) 0.0151 ( 0.0%) 0.0150 ( 0.0%) Merge contiguous icmps into a memcmp 0.0131 ( 0.0%) 0.0019 ( 0.0%) 0.0149 ( 0.0%) 0.0149 ( 0.0%) Phi Values Analysis 0.0121 ( 0.0%) 0.0024 ( 0.1%) 0.0145 ( 0.0%) 0.0147 ( 0.0%) Basic Alias Analysis (stateless AA impl) #12 0.0142 ( 0.0%) 0.0006 ( 0.0%) 0.0147 ( 0.0%) 0.0147 ( 0.0%) Scalar Evolution Analysis #9 0.0138 ( 0.0%) 0.0011 ( 0.0%) 0.0148 ( 0.0%) 0.0146 ( 0.0%) Promote Memory to Register 0.0121 ( 0.0%) 0.0021 ( 0.0%) 0.0142 ( 0.0%) 0.0142 ( 0.0%) Basic Alias Analysis (stateless AA impl) #6 0.0137 ( 0.0%) 0.0005 ( 0.0%) 0.0142 ( 0.0%) 0.0142 ( 0.0%) Canonicalize natural loops #8 0.0121 ( 0.0%) 0.0021 ( 0.0%) 0.0142 ( 0.0%) 0.0140 ( 0.0%) Memory Dependence Analysis #3 0.0119 ( 0.0%) 0.0022 ( 0.0%) 0.0140 ( 0.0%) 0.0140 ( 0.0%) Basic Alias Analysis (stateless AA impl) #10 0.0106 ( 0.0%) 0.0028 ( 0.1%) 0.0134 ( 0.0%) 0.0133 ( 0.0%) Basic Alias Analysis (stateless AA impl) #22 0.0112 ( 0.0%) 0.0019 ( 0.0%) 0.0130 ( 0.0%) 0.0130 ( 0.0%) Basic Alias Analysis (stateless AA impl) #9 0.0128 ( 0.0%) 0.0000 ( 0.0%) 0.0128 ( 0.0%) 0.0128 ( 0.0%) Globals Alias Analysis #2 0.0123 ( 0.0%) 0.0003 ( 0.0%) 0.0126 ( 0.0%) 0.0126 ( 0.0%) Loop-Closed SSA Form Pass #6 0.0122 ( 0.0%) 0.0005 ( 0.0%) 0.0127 ( 0.0%) 0.0126 ( 0.0%) Scalar Evolution Analysis #10 0.0106 ( 0.0%) 0.0019 ( 0.0%) 0.0125 ( 0.0%) 0.0125 ( 0.0%) Memory Dependence Analysis #2 0.0118 ( 0.0%) 0.0008 ( 0.0%) 0.0125 ( 0.0%) 0.0125 ( 0.0%) Function Alias Analysis Results #22 0.0120 ( 0.0%) 0.0004 ( 0.0%) 0.0124 ( 0.0%) 0.0124 ( 0.0%) Loop-Closed SSA Form Pass #5 0.0102 ( 0.0%) 0.0020 ( 0.0%) 0.0122 ( 0.0%) 0.0123 ( 0.0%) Basic Alias Analysis (stateless AA impl) #8 0.0105 ( 0.0%) 0.0018 ( 0.0%) 0.0123 ( 0.0%) 0.0123 ( 0.0%) Demanded bits analysis 0.0110 ( 0.0%) 0.0012 ( 0.0%) 0.0121 ( 0.0%) 0.0121 ( 0.0%) Optimize machine instruction PHIs 0.0115 ( 0.0%) 0.0004 ( 0.0%) 0.0119 ( 0.0%) 0.0119 ( 0.0%) Loop-Closed SSA Form Pass #7 0.0112 ( 0.0%) 0.0006 ( 0.0%) 0.0118 ( 0.0%) 0.0118 ( 0.0%) Loop-Closed SSA Form Pass #4 0.0100 ( 0.0%) 0.0018 ( 0.0%) 0.0117 ( 0.0%) 0.0117 ( 0.0%) Lazy Branch Probability Analysis #2 0.0107 ( 0.0%) 0.0006 ( 0.0%) 0.0113 ( 0.0%) 0.0114 ( 0.0%) Function Alias Analysis Results #14 0.0108 ( 0.0%) 0.0004 ( 0.0%) 0.0112 ( 0.0%) 0.0112 ( 0.0%) Loop Sink 0.0093 ( 0.0%) 0.0019 ( 0.0%) 0.0113 ( 0.0%) 0.0112 ( 0.0%) Lazy Branch Probability Analysis #8 0.0106 ( 0.0%) 0.0006 ( 0.0%) 0.0112 ( 0.0%) 0.0111 ( 0.0%) Scalar Evolution Analysis #4 0.0105 ( 0.0%) 0.0004 ( 0.0%) 0.0109 ( 0.0%) 0.0109 ( 0.0%) Warn about non-applied transformations 0.0092 ( 0.0%) 0.0016 ( 0.0%) 0.0108 ( 0.0%) 0.0109 ( 0.0%) Lazy Branch Probability Analysis #3 0.0091 ( 0.0%) 0.0017 ( 0.0%) 0.0108 ( 0.0%) 0.0108 ( 0.0%) Lazy Branch Probability Analysis #5 0.0102 ( 0.0%) 0.0007 ( 0.0%) 0.0109 ( 0.0%) 0.0107 ( 0.0%) Scalar Evolution Analysis #5 0.0089 ( 0.0%) 0.0018 ( 0.0%) 0.0107 ( 0.0%) 0.0106 ( 0.0%) Lazy Branch Probability Analysis #6 0.0102 ( 0.0%) 0.0003 ( 0.0%) 0.0105 ( 0.0%) 0.0104 ( 0.0%) Spill Code Placement Analysis 0.0101 ( 0.0%) 0.0003 ( 0.0%) 0.0104 ( 0.0%) 0.0104 ( 0.0%) Function Alias Analysis Results #21 0.0086 ( 0.0%) 0.0017 ( 0.0%) 0.0104 ( 0.0%) 0.0103 ( 0.0%) Lazy Branch Probability Analysis #7 0.0098 ( 0.0%) 0.0005 ( 0.0%) 0.0103 ( 0.0%) 0.0102 ( 0.0%) Function Alias Analysis Results #18 0.0087 ( 0.0%) 0.0016 ( 0.0%) 0.0103 ( 0.0%) 0.0102 ( 0.0%) Lazy Branch Probability Analysis #4 0.0093 ( 0.0%) 0.0006 ( 0.0%) 0.0099 ( 0.0%) 0.0099 ( 0.0%) Function Alias Analysis Results #15 0.0094 ( 0.0%) 0.0004 ( 0.0%) 0.0098 ( 0.0%) 0.0098 ( 0.0%) Function Alias Analysis Results #19 0.0090 ( 0.0%) 0.0008 ( 0.0%) 0.0098 ( 0.0%) 0.0098 ( 0.0%) Lazy Branch Probability Analysis 0.0079 ( 0.0%) 0.0017 ( 0.0%) 0.0095 ( 0.0%) 0.0097 ( 0.0%) Instrument function entry/exit with calls to e.g. mcount() (pre inlining) 0.0081 ( 0.0%) 0.0015 ( 0.0%) 0.0096 ( 0.0%) 0.0096 ( 0.0%) Phi Values Analysis #2 0.0078 ( 0.0%) 0.0016 ( 0.0%) 0.0094 ( 0.0%) 0.0096 ( 0.0%) Phi Values Analysis #3 0.0092 ( 0.0%) 0.0003 ( 0.0%) 0.0095 ( 0.0%) 0.0095 ( 0.0%) Live Register Matrix 0.0078 ( 0.0%) 0.0014 ( 0.0%) 0.0093 ( 0.0%) 0.0093 ( 0.0%) Optimization Remark Emitter #3 0.0087 ( 0.0%) 0.0006 ( 0.0%) 0.0093 ( 0.0%) 0.0093 ( 0.0%) Canonicalize natural loops #5 0.0077 ( 0.0%) 0.0014 ( 0.0%) 0.0091 ( 0.0%) 0.0092 ( 0.0%) Optimization Remark Emitter #2 0.0075 ( 0.0%) 0.0015 ( 0.0%) 0.0090 ( 0.0%) 0.0091 ( 0.0%) Optimization Remark Emitter #6 0.0077 ( 0.0%) 0.0014 ( 0.0%) 0.0090 ( 0.0%) 0.0090 ( 0.0%) Lazy Block Frequency Analysis #2 0.0073 ( 0.0%) 0.0016 ( 0.0%) 0.0089 ( 0.0%) 0.0089 ( 0.0%) Optimization Remark Emitter #7 0.0073 ( 0.0%) 0.0016 ( 0.0%) 0.0089 ( 0.0%) 0.0089 ( 0.0%) Optimization Remark Emitter #8 0.0083 ( 0.0%) 0.0006 ( 0.0%) 0.0088 ( 0.0%) 0.0089 ( 0.0%) Function Alias Analysis Results #17 0.0071 ( 0.0%) 0.0015 ( 0.0%) 0.0086 ( 0.0%) 0.0088 ( 0.0%) Optimization Remark Emitter #5 0.0071 ( 0.0%) 0.0014 ( 0.0%) 0.0086 ( 0.0%) 0.0087 ( 0.0%) Optimization Remark Emitter #4 0.0072 ( 0.0%) 0.0015 ( 0.0%) 0.0087 ( 0.0%) 0.0087 ( 0.0%) Lazy Block Frequency Analysis #8 0.0068 ( 0.0%) 0.0014 ( 0.0%) 0.0082 ( 0.0%) 0.0084 ( 0.0%) Lazy Block Frequency Analysis #5 0.0079 ( 0.0%) 0.0006 ( 0.0%) 0.0084 ( 0.0%) 0.0084 ( 0.0%) Function Alias Analysis Results #16 0.0070 ( 0.0%) 0.0014 ( 0.0%) 0.0084 ( 0.0%) 0.0084 ( 0.0%) Lazy Block Frequency Analysis #7 0.0069 ( 0.0%) 0.0014 ( 0.0%) 0.0083 ( 0.0%) 0.0083 ( 0.0%) Lazy Block Frequency Analysis #6 0.0070 ( 0.0%) 0.0013 ( 0.0%) 0.0083 ( 0.0%) 0.0083 ( 0.0%) Lazy Block Frequency Analysis #3 0.0073 ( 0.0%) 0.0006 ( 0.0%) 0.0079 ( 0.0%) 0.0079 ( 0.0%) Optimization Remark Emitter 0.0064 ( 0.0%) 0.0013 ( 0.0%) 0.0078 ( 0.0%) 0.0078 ( 0.0%) Lazy Block Frequency Analysis #4 0.0064 ( 0.0%) 0.0013 ( 0.0%) 0.0077 ( 0.0%) 0.0078 ( 0.0%) LCSSA Verifier 0.0072 ( 0.0%) 0.0006 ( 0.0%) 0.0078 ( 0.0%) 0.0078 ( 0.0%) Lazy Block Frequency Analysis 0.0063 ( 0.0%) 0.0014 ( 0.0%) 0.0076 ( 0.0%) 0.0077 ( 0.0%) LCSSA Verifier #3 0.0062 ( 0.0%) 0.0013 ( 0.0%) 0.0075 ( 0.0%) 0.0077 ( 0.0%) LCSSA Verifier #2 0.0072 ( 0.0%) 0.0005 ( 0.0%) 0.0077 ( 0.0%) 0.0076 ( 0.0%) Loop Access Analysis #3 0.0069 ( 0.0%) 0.0004 ( 0.0%) 0.0073 ( 0.0%) 0.0073 ( 0.0%) Loop Distribution 0.0067 ( 0.0%) 0.0006 ( 0.0%) 0.0072 ( 0.0%) 0.0072 ( 0.0%) Machine Trace Metrics 0.0065 ( 0.0%) 0.0002 ( 0.0%) 0.0067 ( 0.0%) 0.0068 ( 0.0%) Basic Alias Analysis (stateless AA impl) #20 0.0066 ( 0.0%) 0.0000 ( 0.0%) 0.0066 ( 0.0%) 0.0066 ( 0.0%) Deduce function attributes in RPO 0.0057 ( 0.0%) 0.0006 ( 0.0%) 0.0064 ( 0.0%) 0.0062 ( 0.0%) Basic Alias Analysis (stateless AA impl) #19 0.0056 ( 0.0%) 0.0003 ( 0.0%) 0.0059 ( 0.0%) 0.0059 ( 0.0%) Basic Alias Analysis (stateless AA impl) #17 0.0055 ( 0.0%) 0.0003 ( 0.0%) 0.0058 ( 0.0%) 0.0058 ( 0.0%) Shrink Wrapping analysis 0.0055 ( 0.0%) 0.0002 ( 0.0%) 0.0058 ( 0.0%) 0.0058 ( 0.0%) Post RA top-down list latency scheduler 0.0056 ( 0.0%) 0.0002 ( 0.0%) 0.0058 ( 0.0%) 0.0058 ( 0.0%) Basic Alias Analysis (stateless AA impl) #21 0.0053 ( 0.0%) 0.0003 ( 0.0%) 0.0056 ( 0.0%) 0.0056 ( 0.0%) Basic Alias Analysis (stateless AA impl) #14 0.0049 ( 0.0%) 0.0004 ( 0.0%) 0.0053 ( 0.0%) 0.0054 ( 0.0%) Lazy Branch Probability Analysis #18 0.0050 ( 0.0%) 0.0003 ( 0.0%) 0.0053 ( 0.0%) 0.0053 ( 0.0%) Basic Alias Analysis (stateless AA impl) #16 0.0048 ( 0.0%) 0.0002 ( 0.0%) 0.0050 ( 0.0%) 0.0050 ( 0.0%) Basic Alias Analysis (stateless AA impl) #18 0.0046 ( 0.0%) 0.0002 ( 0.0%) 0.0049 ( 0.0%) 0.0049 ( 0.0%) X86 Indirect Branch Tracking 0.0046 ( 0.0%) 0.0002 ( 0.0%) 0.0048 ( 0.0%) 0.0048 ( 0.0%) Lazy Branch Probability Analysis #14 0.0045 ( 0.0%) 0.0003 ( 0.0%) 0.0048 ( 0.0%) 0.0048 ( 0.0%) Basic Alias Analysis (stateless AA impl) #15 0.0043 ( 0.0%) 0.0003 ( 0.0%) 0.0047 ( 0.0%) 0.0047 ( 0.0%) Function Alias Analysis Results #20 0.0044 ( 0.0%) 0.0002 ( 0.0%) 0.0045 ( 0.0%) 0.0045 ( 0.0%) Virtual Register Map 0.0043 ( 0.0%) 0.0002 ( 0.0%) 0.0045 ( 0.0%) 0.0045 ( 0.0%) Expand indirectbr instructions 0.0042 ( 0.0%) 0.0003 ( 0.0%) 0.0045 ( 0.0%) 0.0044 ( 0.0%) Loop Access Analysis 0.0042 ( 0.0%) 0.0002 ( 0.0%) 0.0044 ( 0.0%) 0.0044 ( 0.0%) Lazy Branch Probability Analysis #17 0.0042 ( 0.0%) 0.0002 ( 0.0%) 0.0044 ( 0.0%) 0.0044 ( 0.0%) Implement the 'patchable-function' attribute 0.0041 ( 0.0%) 0.0002 ( 0.0%) 0.0044 ( 0.0%) 0.0043 ( 0.0%) Insert fentry calls 0.0039 ( 0.0%) 0.0002 ( 0.0%) 0.0040 ( 0.0%) 0.0041 ( 0.0%) Lazy Branch Probability Analysis #15 0.0037 ( 0.0%) 0.0004 ( 0.0%) 0.0041 ( 0.0%) 0.0041 ( 0.0%) Local Stack Slot Allocation 0.0036 ( 0.0%) 0.0004 ( 0.0%) 0.0041 ( 0.0%) 0.0041 ( 0.0%) Local Dynamic TLS Access Clean-up 0.0037 ( 0.0%) 0.0001 ( 0.0%) 0.0039 ( 0.0%) 0.0038 ( 0.0%) Machine Optimization Remark Emitter 0.0036 ( 0.0%) 0.0002 ( 0.0%) 0.0038 ( 0.0%) 0.0038 ( 0.0%) Insert XRay ops 0.0035 ( 0.0%) 0.0002 ( 0.0%) 0.0037 ( 0.0%) 0.0038 ( 0.0%) Lazy Branch Probability Analysis #13 0.0035 ( 0.0%) 0.0002 ( 0.0%) 0.0037 ( 0.0%) 0.0037 ( 0.0%) Machine Optimization Remark Emitter #3 0.0035 ( 0.0%) 0.0002 ( 0.0%) 0.0037 ( 0.0%) 0.0037 ( 0.0%) Loop Access Analysis #2 0.0036 ( 0.0%) 0.0002 ( 0.0%) 0.0038 ( 0.0%) 0.0037 ( 0.0%) Alignment from assumptions 0.0035 ( 0.0%) 0.0002 ( 0.0%) 0.0037 ( 0.0%) 0.0037 ( 0.0%) Live Stack Slot Analysis 0.0035 ( 0.0%) 0.0002 ( 0.0%) 0.0037 ( 0.0%) 0.0036 ( 0.0%) Lazy Branch Probability Analysis #16 0.0033 ( 0.0%) 0.0003 ( 0.0%) 0.0036 ( 0.0%) 0.0036 ( 0.0%) Early If-Conversion 0.0034 ( 0.0%) 0.0002 ( 0.0%) 0.0036 ( 0.0%) 0.0036 ( 0.0%) Lazy Branch Probability Analysis #12 0.0034 ( 0.0%) 0.0002 ( 0.0%) 0.0035 ( 0.0%) 0.0035 ( 0.0%) Instrument function entry/exit with calls to e.g. mcount() (post inlining) 0.0033 ( 0.0%) 0.0002 ( 0.0%) 0.0035 ( 0.0%) 0.0034 ( 0.0%) Demanded bits analysis #3 0.0032 ( 0.0%) 0.0002 ( 0.0%) 0.0034 ( 0.0%) 0.0034 ( 0.0%) Machine Optimization Remark Emitter #2 0.0032 ( 0.0%) 0.0002 ( 0.0%) 0.0034 ( 0.0%) 0.0034 ( 0.0%) Contiguously Lay Out Funclets 0.0031 ( 0.0%) 0.0004 ( 0.0%) 0.0035 ( 0.0%) 0.0034 ( 0.0%) X86 Domain Reassignment Pass 0.0033 ( 0.0%) 0.0002 ( 0.0%) 0.0035 ( 0.0%) 0.0034 ( 0.0%) Lazy Branch Probability Analysis #10 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0034 ( 0.0%) Lazy Branch Probability Analysis #9 0.0032 ( 0.0%) 0.0003 ( 0.0%) 0.0034 ( 0.0%) 0.0034 ( 0.0%) Lazy Block Frequency Analysis #18 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0034 ( 0.0%) Lazy Branch Probability Analysis #11 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0034 ( 0.0%) Demanded bits analysis #2 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0033 ( 0.0%) Live DEBUG_VALUE analysis 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0033 ( 0.0%) X86 Atom pad short functions 0.0031 ( 0.0%) 0.0001 ( 0.0%) 0.0032 ( 0.0%) 0.0033 ( 0.0%) Optimization Remark Emitter #14 0.0031 ( 0.0%) 0.0002 ( 0.0%) 0.0033 ( 0.0%) 0.0033 ( 0.0%) Lazy Machine Block Frequency Analysis #7 0.0030 ( 0.0%) 0.0002 ( 0.0%) 0.0032 ( 0.0%) 0.0032 ( 0.0%) StackMap Liveness Analysis 0.0032 ( 0.0%) 0.0001 ( 0.0%) 0.0033 ( 0.0%) 0.0032 ( 0.0%) Analyze Machine Code For Garbage Collection 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0032 ( 0.0%) X86 speculative load hardening 0.0029 ( 0.0%) 0.0003 ( 0.0%) 0.0032 ( 0.0%) 0.0032 ( 0.0%) Lazy Machine Block Frequency Analysis 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0032 ( 0.0%) 0.0032 ( 0.0%) X86 Retpoline Thunks 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0032 ( 0.0%) Optimization Remark Emitter #15 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0032 ( 0.0%) Optimization Remark Emitter #12 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0031 ( 0.0%) X86 vzeroupper inserter 0.0030 ( 0.0%) 0.0002 ( 0.0%) 0.0032 ( 0.0%) 0.0031 ( 0.0%) Lazy Machine Block Frequency Analysis #3 0.0031 ( 0.0%) 0.0001 ( 0.0%) 0.0032 ( 0.0%) 0.0031 ( 0.0%) Lazy Machine Block Frequency Analysis #4 0.0031 ( 0.0%) 0.0001 ( 0.0%) 0.0032 ( 0.0%) 0.0031 ( 0.0%) Lazy Block Frequency Analysis #17 0.0030 ( 0.0%) 0.0002 ( 0.0%) 0.0032 ( 0.0%) 0.0031 ( 0.0%) Lazy Machine Block Frequency Analysis #5 0.0030 ( 0.0%) 0.0001 ( 0.0%) 0.0031 ( 0.0%) 0.0031 ( 0.0%) Rename Disconnected Subregister Components 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0031 ( 0.0%) Optimization Remark Emitter #11 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0031 ( 0.0%) Lazy Machine Block Frequency Analysis #6 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0031 ( 0.0%) 0.0031 ( 0.0%) X86 WinAlloca Expander 0.0029 ( 0.0%) 0.0002 ( 0.0%) 0.0030 ( 0.0%) 0.0031 ( 0.0%) Lazy Block Frequency Analysis #13 0.0028 ( 0.0%) 0.0002 ( 0.0%) 0.0030 ( 0.0%) 0.0031 ( 0.0%) Optimization Remark Emitter #9 0.0028 ( 0.0%) 0.0002 ( 0.0%) 0.0030 ( 0.0%) 0.0030 ( 0.0%) X86 Discriminate Memory Operands 0.0028 ( 0.0%) 0.0001 ( 0.0%) 0.0030 ( 0.0%) 0.0030 ( 0.0%) Lazy Block Frequency Analysis #14 0.0028 ( 0.0%) 0.0001 ( 0.0%) 0.0030 ( 0.0%) 0.0030 ( 0.0%) Optimization Remark Emitter #13 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0030 ( 0.0%) X86 Insert Cache Prefetches 0.0026 ( 0.0%) 0.0003 ( 0.0%) 0.0029 ( 0.0%) 0.0030 ( 0.0%) X86 PIC Global Base Reg Initialization 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0030 ( 0.0%) Lazy Block Frequency Analysis #15 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0030 ( 0.0%) Lazy Machine Block Frequency Analysis #9 0.0027 ( 0.0%) 0.0003 ( 0.0%) 0.0029 ( 0.0%) 0.0029 ( 0.0%) Lazy Machine Block Frequency Analysis #2 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0029 ( 0.0%) Optimization Remark Emitter #10 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0029 ( 0.0%) Detect Dead Lanes 0.0028 ( 0.0%) 0.0002 ( 0.0%) 0.0030 ( 0.0%) 0.0029 ( 0.0%) Lazy Machine Block Frequency Analysis #8 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0029 ( 0.0%) Compressing EVEX instrs to VEX encoding when possible 0.0027 ( 0.0%) 0.0002 ( 0.0%) 0.0029 ( 0.0%) 0.0029 ( 0.0%) Safe Stack instrumentation pass 0.0026 ( 0.0%) 0.0002 ( 0.0%) 0.0028 ( 0.0%) 0.0028 ( 0.0%) Lazy Block Frequency Analysis #12 0.0026 ( 0.0%) 0.0001 ( 0.0%) 0.0028 ( 0.0%) 0.0028 ( 0.0%) Optimization Remark Emitter #16 0.0026 ( 0.0%) 0.0001 ( 0.0%) 0.0028 ( 0.0%) 0.0028 ( 0.0%) Lazy Block Frequency Analysis #16 0.0025 ( 0.0%) 0.0002 ( 0.0%) 0.0027 ( 0.0%) 0.0028 ( 0.0%) Lazy Block Frequency Analysis #10 0.0026 ( 0.0%) 0.0002 ( 0.0%) 0.0027 ( 0.0%) 0.0027 ( 0.0%) Lazy Block Frequency Analysis #9 0.0026 ( 0.0%) 0.0001 ( 0.0%) 0.0026 ( 0.0%) 0.0027 ( 0.0%) Shadow Stack GC Lowering 0.0025 ( 0.0%) 0.0002 ( 0.0%) 0.0027 ( 0.0%) 0.0027 ( 0.0%) Lazy Block Frequency Analysis #11 0.0026 ( 0.0%) 0.0001 ( 0.0%) 0.0027 ( 0.0%) 0.0027 ( 0.0%) Lower Garbage Collection Instructions 0.0024 ( 0.0%) 0.0001 ( 0.0%) 0.0026 ( 0.0%) 0.0026 ( 0.0%) LCSSA Verifier #6 0.0024 ( 0.0%) 0.0001 ( 0.0%) 0.0025 ( 0.0%) 0.0026 ( 0.0%) LCSSA Verifier #5 0.0023 ( 0.0%) 0.0002 ( 0.0%) 0.0025 ( 0.0%) 0.0025 ( 0.0%) ObjC ARC contraction 0.0023 ( 0.0%) 0.0002 ( 0.0%) 0.0025 ( 0.0%) 0.0025 ( 0.0%) LCSSA Verifier #4 0.0023 ( 0.0%) 0.0001 ( 0.0%) 0.0024 ( 0.0%) 0.0024 ( 0.0%) LCSSA Verifier #7 0.0016 ( 0.0%) 0.0000 ( 0.0%) 0.0016 ( 0.0%) 0.0016 ( 0.0%) Infer set function attributes 0.0012 ( 0.0%) 0.0000 ( 0.0%) 0.0012 ( 0.0%) 0.0012 ( 0.0%) Assumption Cache Tracker 0.0011 ( 0.0%) 0.0000 ( 0.0%) 0.0011 ( 0.0%) 0.0011 ( 0.0%) Merge Duplicate Global Constants 0.0005 ( 0.0%) 0.0000 ( 0.0%) 0.0005 ( 0.0%) 0.0005 ( 0.0%) Eliminate Available Externally Globals 0.0005 ( 0.0%) 0.0000 ( 0.0%) 0.0005 ( 0.0%) 0.0005 ( 0.0%) Assumption Cache Tracker #2 0.0005 ( 0.0%) 0.0000 ( 0.0%) 0.0005 ( 0.0%) 0.0005 ( 0.0%) Strip Unused Function Prototypes 0.0005 ( 0.0%) 0.0000 ( 0.0%) 0.0005 ( 0.0%) 0.0005 ( 0.0%) Pre-ISel Intrinsic Lowering 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Deduce and propagate attributes 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Force set function attributes 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Profile summary info #2 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) A No-Op Barrier Pass 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Rewrite Symbols 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Create Garbage Collector Module Metadata 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Profile summary info 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Target Library Information 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Scoped NoAlias Alias Analysis 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Target Transform Information 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Type-Based Alias Analysis 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Type-Based Alias Analysis #2 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Target Library Information #2 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Target Pass Configuration 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Machine Module Information 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Machine Branch Probability Analysis 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Scoped NoAlias Alias Analysis #2 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) 0.0000 ( 0.0%) Target Transform Information #2 100.7337 (100.0%) 4.6138 (100.0%) 105.3474 (100.0%) 105.3627 (100.0%) Total

===-------------------------------------------------------------------------=== Clang front-end time report ===-------------------------------------------------------------------------=== Total Execution Time: 114.9768 seconds (115.1020 wall clock)