Closed paulmenzel closed 2 months ago
it could be an issue with the perf_event driver with your CPU. Could you please try this:
su
export PCM_NO_PERF=1
pcm -r
$ sudo PCM_NO_PERF=1 ./bin/pcm -r
Intel(r) Performance Counter Monitor ($Format:%ci ID=%h$)
===== Processor information =====
Linux arch_perfmon flag : yes
Hybrid processor : no
IBRS and IBPB supported : yes
STIBP supported : yes
Spec arch caps supported : yes
Max CPUID level : 22
CPU model number : 142
Number of physical cores: 2
Number of logical cores: 4
Number of online logical cores: 4
Threads (logical cores) per physical core: 2
Num sockets: 1
Physical cores per socket: 2
Last level cache slices per socket: 2
Core PMU (perfmon) version: 4
Number of core PMU generic (programmable) counters: 3
Width of generic (programmable) counters: 48 bits
Number of core PMU fixed counters: 3
Width of fixed counters: 48 bits
Nominal core frequency: 2900000000 Hz
IBRS enabled in the kernel : no
STIBP enabled in the kernel : no
The processor is not susceptible to Rogue Data Cache Load: no
The processor supports enhanced IBRS : no
Package thermal spec power: 15 Watt; Package minimum power: 0 Watt; Package maximum power: 0 Watt;
INFO: Linux perf interface to program uncore PMUs is present
Socket 0: 0 PCU units detected. 0 IIO units detected. 0 IRP units detected. 0 CHA/CBO units detected. 0 MDF units detected. 0 UBOX units detected. 0 CXL units detected. 0 PCIE_GEN5x16 units detected. 0 PCIE_GEN5x8 units detected.
Resetting PMU configuration
Zeroed PMU registers
Disabling NMI watchdog since it consumes one hw-PMU counter. To keep NMI watchdog set environment variable PCM_KEEP_NMI_WATCHDOG=1 (this reduces the core metrics set)
Closed perf event handles
Trying to use Linux perf events...
Usage of Linux perf events is disabled through PCM_NO_PERF environment variable. Using direct PMU programming...
WARNING: Custom counter 0 is in use. MSR_PERF_GLOBAL_INUSE on core 0: 0x8000000000000009
WARNING: Core 0 IA32_PERFEVTSEL0_ADDR is not zeroed 1245244
WARNING: Custom counter 0 is in use. MSR_PERF_GLOBAL_INUSE on core 1: 0x8000000000000009
WARNING: Core 1 IA32_PERFEVTSEL0_ADDR is not zeroed 1245244
WARNING: Custom counter 0 is in use. MSR_PERF_GLOBAL_INUSE on core 2: 0x8000000000000009
WARNING: Core 2 IA32_PERFEVTSEL0_ADDR is not zeroed 1245244
WARNING: Custom counter 0 is in use. MSR_PERF_GLOBAL_INUSE on core 3: 0x8000000000000009
WARNING: Core 3 IA32_PERFEVTSEL0_ADDR is not zeroed 1245244
Detected Intel(R) Core(TM) i7-7500U CPU @ 2.70GHz "Intel(r) microarchitecture codename Kabylake/Whiskey Lake" stepping 9 microcode level 0xf4
UTIL : utlization (same as core C0 state active state residency, the value is in 0..1)
IPC : instructions per CPU cycle
CFREQ : core frequency in Ghz
L3MISS: L3 (read) cache misses
L3HIT : L3 (read) cache hit ratio (0.00-1.00)
L3MPI : number of L3 (read) cache misses per instruction
L2MPI : number of L2 (read) cache misses per instruction
READ : bytes read from main memory controller (in GBytes)
WRITE : bytes written to main memory controller (in GBytes)
IO : bytes read/written due to IO requests to memory controller (in GBytes); this may be an over estimate due to same-cache-line partial requests
IA : bytes read/written due to IA requests to memory controller (in GBytes); this may be an over estimate due to same-cache-line partial requests
GT : bytes read/written due to GT requests to memory controller (in GBytes); this may be an over estimate due to same-cache-line partial requests
TEMP : Temperature reading in 1 degree Celsius relative to the TjMax temperature (thermal headroom): 0 corresponds to the max temperature
energy: Energy in Joules
Core (SKT) | UTIL | IPC | CFREQ | L3MISS | L2MISS | L3HIT | L3MPI | L2MPI | TEMP
0 0 0.03 0.69 1.54 33 K 59 K 0.41 0.0012 0.0022 54
1 0 0.01 0.30 0.98 10 K 17 K 0.38 0.0047 0.0082 55
2 0 0.01 0.50 1.11 25 K 40 K 0.35 0.0034 0.0053 54
3 0 0.02 0.70 1.18 31 K 57 K 0.44 0.0019 0.0036 55
---------------------------------------------------------------------------------------------------------------
SKT 0 0.02 0.62 1.28 101 K 175 K 0.41 0.0019 0.0033 51
---------------------------------------------------------------------------------------------------------------
TOTAL * 0.02 0.62 1.28 101 K 175 K 0.41 0.0019 0.0033 N/A
Instructions retired: 53 M ; Active cycles: 85 M ; Time (TSC): 2909 Mticks ; C0 (active,non-halted) core residency: 1.66 %
C1 core residency: 2.57 %; C3 core residency: 0.05 %; C6 core residency: 1.06 %; C7 core residency: 94.66 %;
C0 package residency: 8.46 %; C2 package residency: 23.43 %; C3 package residency: 1.93 %; C6 package residency: 2.47 %; C7 package residency: 0.02 %; C8 package residency: 63.69 %; C9 package residency: 0.00 %; C10 package residency: 0.00 %;
┌────────────────────────────────────────────────────────────────────────────────┐
Core C-state distribution│01167777777777777777777777777777777777777777777777777777777777777777777777777777│
└────────────────────────────────────────────────────────────────────────────────┘
┌─────────────────────────────────────────────────────────────────────────────────┐
Package C-state distribution│000000022222222222222222223366888888888888888888888888888888888888888888888888888│
└─────────────────────────────────────────────────────────────────────────────────┘
---------------------------------------------------------------------------------------------------------------
MEM (GB)->| READ | WRITE | IO | IA | GT | CPU energy | PP0 energy | PP1 energy |
---------------------------------------------------------------------------------------------------------------
SKT 0 0.46 0.23 0.25 0.23 0.20 1.00 0.10 0.15
---------------------------------------------------------------------------------------------------------------
[…]
this seems to be resolved with "export PCM_NO_PERF=1". Closing
On the Intel Kaby Lake laptop Dell XPS 13 9360,
-nan
is shown in the output: