Open GoogleCodeExporter opened 9 years ago
Hi, the event ID and umask I use for these events are according to the
documentation. In the documentation it says that FUB is CRI :-). No idea what
this means, they do not introduce those terms. The only way to say who is right
is to compare against a microbenchmark where you know the result. I plan to do
this for Phi also. I find it suspicious that the vtune results are all flat to
the fifth digit. Are those end to end measurements?
Original comment by jan.trei...@gmail.com
on 12 Feb 2014 at 3:35
I am running the same program pinned on different core on xeon phi and
measuring the same event. and the values are different in different
cores. please checkout the result of multiple runs.
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 58 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 58
L2_READ_HIT_M,10761.000000
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 58 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 58
L2_READ_HIT_M,11010.000000
~/perf_anal $ ./work_1.sh
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 40 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 40
L2_READ_HIT_M,0.000000
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 10 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 10
L2_READ_HIT_M,10768.000000
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 40 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 40
L2_READ_HIT_M,0.000000
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 54 -O -m /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
=====================
Region: Compute
=====================
Region Info,core 54
RDTSC Runtime [s],0.021591
call count,1.000000
Event,core 54
L2_READ_HIT_M,158.000000
~/perf_anal $ /home/snataraj/perf_anal/likwid/likwid-perfctr -g
L2_READ_HIT_M:PMC0 -C 4 -O /home/snataraj/perf_anal/copy
-------------------------------------------------------------
-------------------------------------------------------------
CPU type: Intel Xeon Phi Coprocessor
CPU clock: 1.05 GHz
-------------------------------------------------------------
/home/snataraj/perf_anal/copy
K=1048577
Status: 0x0
Event,core 4
L2_READ_HIT_M,524291.000000
Original comment by sur...@gmail.com
on 13 Feb 2014 at 6:10
Reply from intel forum:
FUB must stand for something like "Functional Unit Block" because P54C
refers to the processor core (P54C is a specific version of the Pentium
core, though the actual core in the Xeon Phi has been heavily upgraded
from the original P54C), CRI refers to the "Cache-Ring-Interface", and
VPU refers to the "Vector-Processing-Unit". In the Xeon Phi performance
counters, the UMASK field actually specifies the functional unit for
which the event is requested, with 0x00 referring to the core (P54C),
0x10 referring to the CRI, and 0x20 referring to the VPU. This was
confusing to me at first because because on other processors the UMASK
is almost always used to modify the specific details of what is measured
by an Event Select code, rather than actually specifying the Unit on
which the measurements should occur. On Xeon Phi the UMASK field is
used in a way that is much closer to what you would expect a "Unit Mask"
to mean -- it specifies the "Unit" for which you want the measurements
to be taken.
Original comment by sur...@gmail.com
on 14 Feb 2014 at 10:02
Original issue reported on code.google.com by
sur...@gmail.com
on 6 Feb 2014 at 4:20