[Perf] Linux/x64: 3 Regressions on 8/22/2023 10:44:27 PM

performanceautofiler[bot] commented 1 year ago

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	4a5695b6ea0e461e6527de057420872369345282
Compare	c4f8c3b27a8fb2e452a57a6516050c8c2ee677f4
Diff	Diff
Configs	AOT:true, CompilationMode:wasm, RunKind:micro

Regressions in System.Collections.Tests.Perf_BitArray

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
[BitArrayGet - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_x64_ubuntu 22.04_AOT=true_CompilationMode=wasm_RunKind=micro/System.Collections.Tests.Perf_BitArray.BitArrayGet(Size%3a%204).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	66.80 ns	74.12 ns	1.11	0.31	False
[BitArrayGet - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_x64_ubuntu 22.04_AOT=true_CompilationMode=wasm_RunKind=micro/System.Collections.Tests.Perf_BitArray.BitArrayGet(Size%3a%20512).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	8.08 μs	8.98 μs	1.11	0.25	False

graph graph Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.Tests.Perf_BitArray*'

### Payloads [Baseline]() [Compare]() ### System.Collections.Tests.Perf_BitArray.BitArrayGet(Size: 4) #### ETL Files #### Histogram #### Description of detection logic ``` IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsRegressionWindowed: Marked as regression because 74.12437428306774 > 70.7145489181565. IsChangePoint: Marked as a change because one of 8/22/2023 4:48:44 PM, 8/30/2023 5:40:03 PM falls between 8/22/2023 3:00:25 AM and 8/30/2023 5:40:03 PM. IsRegressionStdDev: Marked as regression because -7.694913060531819 (T) = (0 -75.19581308946532) / Math.Sqrt((5.864234814470246 / (11)) + (6.990967844102838 / (24))) is less than -2.034515297446192 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (11) + (24) - 2, .025) and -0.10243093080470528 = (68.20909227807778 - 75.19581308946532) / 68.20909227807778 is less than -0.05. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so. ``` #### JIT Disasms ### System.Collections.Tests.Perf_BitArray.BitArrayGet(Size: 512) #### ETL Files #### Histogram #### Description of detection logic ``` IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsRegressionWindowed: Marked as regression because 8.97854392658651 > 8.49925886642583. IsChangePoint: Marked as a change because one of 8/22/2023 10:44:27 PM, 8/30/2023 5:40:03 PM falls between 8/22/2023 3:00:25 AM and 8/30/2023 5:40:03 PM. IsRegressionStdDev: Marked as regression because -9.549925340573548 (T) = (0 -9091.340456789718) / Math.Sqrt((63018.16682920582 / (12)) + (96619.94960445144 / (23))) is less than -2.034515297446192 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (12) + (23) - 2, .025) and -0.11374386896636732 = (8162.864649685678 - 9091.340456789718) / 8162.864649685678 is less than -0.05. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so. ``` #### JIT Disasms ### Docs [Profiling workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/profiling-workflow-dotnet-runtime.md) [Benchmarking workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/benchmarking-workflow-dotnet-runtime.md)

Run Information

Name	Value
Architecture	x64
OS	ubuntu 22.04
Queue	TigerUbuntu
Baseline	4a5695b6ea0e461e6527de057420872369345282
Compare	c4f8c3b27a8fb2e452a57a6516050c8c2ee677f4
Diff	Diff
Configs	AOT:true, CompilationMode:wasm, RunKind:micro

Regressions in System.Memory.Constructors<Byte>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
[MemoryFromArrayStartLength - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_x64_ubuntu 22.04_AOT=true_CompilationMode=wasm_RunKind=micro/System.Memory.Constructors(Byte).MemoryFromArrayStartLength.html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	9.86 ns	12.67 ns	1.28	0.71	False

graph Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Memory.Constructors&lt;Byte&gt;*'

### Payloads [Baseline]() [Compare]() ### System.Memory.Constructors<Byte>.MemoryFromArrayStartLength #### ETL Files #### Histogram #### Description of detection logic ``` IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small. IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline. IsRegressionWindowed: Marked as regression because 12.671745349502759 > 10.369235546519446. IsChangePoint: Marked as a change because one of 8/22/2023 4:48:44 PM, 8/30/2023 5:40:03 PM falls between 8/22/2023 3:00:25 AM and 8/30/2023 5:40:03 PM. IsRegressionStdDev: Marked as regression because -3.0455686205822126 (T) = (0 -13.79542183423797) / Math.Sqrt((7.726260520053205 / (11)) + (3.821401937402991 / (24))) is less than -2.034515297446192 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (11) + (24) - 2, .025) and -0.2577388233788657 = (10.968431265544554 - 13.79542183423797) / 10.968431265544554 is less than -0.05. IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small. IsChangeEdgeDetector: Marked not as a regression because Edge Detector said so. ``` #### JIT Disasms ### Docs [Profiling workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/profiling-workflow-dotnet-runtime.md) [Benchmarking workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/benchmarking-workflow-dotnet-runtime.md)

radekdoulik commented 1 year ago

@kg looks like https://github.com/dotnet/runtime/pull/90928

kg commented 1 year ago

seems like a measurement problem, unless bdn is running out of table space - but that would regress everything.

kg commented 1 year ago

Looking at the current data these three have stayed slower since then, so something did regress. Did any configuration changes happen around that time? None of what changed in jiterp should affect this stuff unless somehow these tests are running in the interpreter instead of AOT'd code. There's not much historical data so I can't tell whether these got faster when jiterp was turned on originally

dotnet / perf-autofiling-issues