intel / xpumanager

MIT License
87 stars 18 forks source link

xpu-smi returns no or unexpected info #69

Closed js333031 closed 8 months ago

js333031 commented 9 months ago
  1. Looks like temperature is missing and temperature/freq columns are reversed
(base) ats@localhost:~> xpu-smi dump -d 0 -m 0,1,2,3
Timestamp, DeviceId, GPU Utilization (%), GPU Power (W), GPU Frequency (MHz), GPU Core Temperature (Celsius Degree)
18:38:50.000,    0,     , 43.72, 2050,
18:38:51.000,    0,     , 43.74, 2050,
18:38:52.000,    0,     , 43.72, 2050,
18:38:53.000,    0,     , 43.72, 2050,
18:38:54.000,    0,     , 43.72, 2050,
18:38:55.000,    0,     , 43.70, 2050,
  1. xpu-smi returns PCIe Generation and Max Link width as -1

System is openSUSE 15.5 Leap.

(base) ats@localhost:~> xpu-smi discovery -d 0
+-----------+--------------------------------------------------------------------------------------+
| Device ID | Device Information                                                                   |
+-----------+--------------------------------------------------------------------------------------+
| 0         | Device Type: GPU                                                                     |
|           | Device Name: Intel(R) Data Center GPU Flex 170                                       |
|           | PCI Device ID: 0x56c0                                                                |
|           | Vendor Name: Intel(R) Corporation                                                    |
|           | UUID: 00000000-0000-0000-b58c-cba33270b5fc                                           |
|           | Core Clock Rate: 2050 MHz                                                            |
|           | Stepping: C0                                                                         |
|           | SKU Type:                                                                            |
|           |                                                                                      |
|           | Driver Version: I915_23.6.37_PSB_230425.49                                           |
|           | Kernel Version: 5.14.21-150500.55.36-default                                         |
|           | GFX Firmware Name: GFX                                                               |
|           | GFX Firmware Version: unknown                                                        |
|           | GFX Firmware Status: normal                                                          |
|           |                                                                                      |
|           | PCI BDF Address: 0000:3a:00.0                                                        |
|           | PCI Slot:                                                                            |
|           | PCIe Generation: -1                                                                  |
|           | PCIe Max Link Width: -1                                                              |
|           |                                                                                      |
|           | Memory Physical Size: 16288.00 MiB                                                   |
|           | Max Mem Alloc Size: 4095.99 MiB                                                      |
|           | ECC State:                                                                           |
|           | Number of Memory Channels: 2                                                         |
|           | Memory Bus Width: 128                                                                |
|           | Max Hardware Contexts: 65536                                                         |
|           | Max Command Queue Priority: 0                                                        |
|           |                                                                                      |
|           | Number of EUs: 512                                                                   |
|           | Number of Tiles: 1                                                                   |
|           | Number of Slices: 1                                                                  |
|           | Number of Sub Slices per Slice: 32                                                   |
|           | Number of Threads per EU: 8                                                          |
|           | Physical EU SIMD Width: 8                                                            |
|           | Number of Media Engines: 0                                                           |
|           | Number of Media Enhancement Engines: 0                                               |
|           |                                                                                      |
|           | Xe Link Calibration Date: Not Calibrated                                             |
+-----------+--------------------------------------------------------------------------------------+
(base) ats@localhost:~>

System:

CPU: Intel(R) Xeon(R) Platinum 8480+
(base) ats@localhost:~> cat /etc/os-release
NAME="openSUSE Leap"
VERSION="15.5"
ID="opensuse-leap"
ID_LIKE="suse opensuse"
VERSION_ID="15.5"
PRETTY_NAME="openSUSE Leap 15.5"
ANSI_COLOR="0;32"
CPE_NAME="cpe:/o:opensuse:leap:15.5"
BUG_REPORT_URL="https://bugs.opensuse.org"
HOME_URL="https://www.opensuse.org/"
DOCUMENTATION_URL="https://en.opensuse.org/Portal:Leap"
LOGO="distributor-logo-Leap"
(base) ats@localhost:~>
testdig commented 9 months ago

@js333031 could you try to add sudo before running the commands?

testdig commented 9 months ago

@js333031, did you try it? We may close this issue if no update from you by the end of this week.

js333031 commented 9 months ago

Yes please