[Perf] Linux/arm64: 15 Regressions on 5/4/2024 5:29:12 AM

performanceautofiler[bot] commented 5 months ago

Run Information

Name	Value
Architecture	arm64
OS	ubuntu 22.04
Queue	AmpereUbuntu
Baseline	e965312582a33c0acf2020648b54a152a80c139a
Compare	5962fd511e3eacf7fe91520392c041e94e5d31cc
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector
[FusedMultiplyAdd_ScalarAddend - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_ScalarAddend(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	53.07 ns	150.70 ns	2.84	0.04	True
[FusedMultiplyAdd_Vectors - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_Vectors(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	1.62 μs	3.21 μs	1.99	0.03	True
[Truncate - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Truncate(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	658.28 ns	2.85 μs	4.33	0.02	True
[Pow_ScalarBase - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_ScalarBase(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	2.08 μs	2.56 μs	1.23	0.01	True
[Pow_Vectors - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_Vectors(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	48.40 μs	60.44 μs	1.25	0.01	True
[Pow_ScalarBase - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_ScalarBase(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	48.94 μs	61.04 μs	1.25	0.01	True
[FusedMultiplyAdd_ScalarMultiplier - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_ScalarMultiplier(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	872.01 ns	3.18 μs	3.65	0.04	True
[FusedMultiplyAdd_Vectors - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_Vectors(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	68.76 ns	140.01 ns	2.04	0.02	True
[Pow_ScalarExponent - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_ScalarExponent(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	2.05 μs	2.51 μs	1.22	0.01	True
[FusedMultiplyAdd_ScalarAddend - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_ScalarAddend(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	817.70 ns	3.20 μs	3.91	0.14	True
[Pow_Vectors - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_Vectors(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	2.07 μs	2.52 μs	1.22	0.01	True
[Pow_ScalarExponent - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).Pow_ScalarExponent(BufferLength%3a%203079).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	48.04 μs	60.23 μs	1.25	0.01	True
[FusedMultiplyAdd_ScalarMultiplier - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives(Double).FusedMultiplyAdd_ScalarMultiplier(BufferLength%3a%20128).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	54.65 ns	151.26 ns	2.77	0.03	True

graph graph graph graph graph graph graph graph graph graph graph graph graph Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives&lt;Double&gt;*'

### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_ScalarAddend(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_Vectors(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Truncate(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_ScalarBase(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_Vectors(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_ScalarBase(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_ScalarMultiplier(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_Vectors(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_ScalarExponent(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_ScalarAddend(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_Vectors(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.Pow_ScalarExponent(BufferLength: 3079) #### ETL Files #### Histogram #### JIT Disasms ### System.Numerics.Tensors.Tests.Perf_FloatingPointTensorPrimitives<Double>.FusedMultiplyAdd_ScalarMultiplier(BufferLength: 128) #### ETL Files #### Histogram #### JIT Disasms ### Docs [Profiling workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/profiling-workflow-dotnet-runtime.md) [Benchmarking workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/benchmarking-workflow-dotnet-runtime.md)

Run Information

Name	Value
Architecture	arm64
OS	ubuntu 22.04
Queue	AmpereUbuntu
Baseline	e965312582a33c0acf2020648b54a152a80c139a
Compare	5962fd511e3eacf7fe91520392c041e94e5d31cc
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Memory.Span<Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
[SequenceEqual - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Memory.Span(Int32).SequenceEqual(Size%3a%204).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	3.46 ns	5.32 ns	1.54	0.24	False

graph Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Memory.Span&lt;Int32&gt;*'

### System.Memory.Span<Int32>.SequenceEqual(Size: 4) #### ETL Files #### Histogram #### JIT Disasms ### Docs [Profiling workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/profiling-workflow-dotnet-runtime.md) [Benchmarking workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/benchmarking-workflow-dotnet-runtime.md)

Run Information

Name	Value
Architecture	arm64
OS	ubuntu 22.04
Queue	AmpereUbuntu
Baseline	e965312582a33c0acf2020648b54a152a80c139a
Compare	5962fd511e3eacf7fe91520392c041e94e5d31cc
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.Collections.TryGetValueFalse<Int32, Int32>

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio
[Dictionary - Duration of single invocation](<https://pvscmdupload.z22.web.core.windows.net/reports/allTestHistory/refs/heads/main_arm64_ubuntu 22.04/System.Collections.TryGetValueFalse(Int32%2c%20Int32).Dictionary(Size%3a%20512).html>) 📝 - Benchmark Source ADX - Test Multi Config Graph	4.13 μs	5.96 μs	1.44	0.08	False

graph Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

git clone https://github.com/dotnet/performance.git
python3 .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.Collections.TryGetValueFalse&lt;Int32, Int32&gt;*'

### System.Collections.TryGetValueFalse<Int32, Int32>.Dictionary(Size: 512) #### ETL Files #### Histogram #### JIT Disasms ### Docs [Profiling workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/profiling-workflow-dotnet-runtime.md) [Benchmarking workflow for dotnet/runtime repository](https://github.com/dotnet/performance/blob/master/docs/benchmarking-workflow-dotnet-runtime.md)