rr-debugger / rr

Record and Replay Framework
http://rr-project.org/
Other
9.2k stars 585 forks source link

AMD Assertion: 'Assertion `!counting_period || interrupt_val <= adjusted_counting_period' failed to hold' #2834

Open tomrittervg opened 3 years ago

tomrittervg commented 3 years ago

This is related to #2720 but is triggering frequently on playback; not just recording, on AMD CPUs. (Specifically threadrippers.)

Bumping the skid from 10000 to 30000 worked for Nika; but made it extremely slow.

The issue is (I'm told) that on these CPUs the PMU interrupt is fired with a variable (and often too high) latency; causing us to overshoot the bounds.

tomrittervg commented 3 years ago

Fiddling with kernel.perf_event_max_sample_rate as described in #2720 might be a solution, but personally I never saw those error messages in my failures.