Open EgorBo opened 1 year ago
I couldn't figure out the best area label to add to this issue. If you have write-permissions please help me learn by adding exactly one area label.
Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch, @kunalspathak See info in area-owners.md if you want to be subscribed.
Author: | EgorBo |
---|---|
Assignees: | EgorBo |
Labels: | `area-CodeGen-coreclr`, `untriaged` |
Milestone: | - |
Fun-fact: DOTNET_TC_CallCountingDelayMs=1
makes TC=1 (default)
faster than TC=0
. So apparently there is a huge contention to install call counting stubs
cc @noahfalk @kouvel
The regression is seen after enabling dynamic PGO in https://github.com/dotnet/runtime/pull/86225
@davidwrighton @mangod9
As mentioned offline we also ought to start measuring with the NAOT'd crossgen2.
I was measuring
crossgen2.exe -O SPC.dll
(actually, the exact command we use forbuild Clr.NativeCoreLib -c Release
) and noticed a few problems:Legend:
TC
-DOTNET_TieredCompilation
(1
by default)PGO
-DOTNET_TieredPGO
(0
by default)CCDelayMS
-DOTNET_TC_CallCountThreshold
(100
by default)The difference is quite noticeable so worth investigating - numbers are quite stable across multiple runs. Judging by the effect from
DOTNET_TC_CallCountThreshold
we're having some contention for call counting stub installation/promotion to tier1.