GPUOpen-LibrariesAndSDKs / MxGPU-Virtualization

MIT License
182 stars 84 forks source link

CentOS 7 as hostos: gim error:(set_new_adapter:617) asic does not support SRIOV #10

Open zhangwenjian opened 5 years ago

zhangwenjian commented 5 years ago

Hello!

I'm trying to use an AMD FirePro S7150 in a CentOS 7.3 system. Here is some information regarding the setup:

[root@host001 /]# uname -r 3.10.0-514.el7.x86_64

[root@host001 /]# dmesg | grep gim [ 10.859725] gim: loading out-of-tree module taints kernel. [ 10.860008] gim: module verification failed: signature and/or required key missing - tainting kernel [ 10.861041] gim info:(gim_init:144) Start AMD open source GIM initialization [ 10.861045] gim info:(gim_init:147) GPU IOV MODULE - version 1.0.0-k [ 10.861047] gim info:(gim_init:149) Copyright (c) 2014-2017 AMD Corporation. [ 10.861069] gim info:(init_config:305) INIT CONFIG [ 10.861091] gim info:(set_new_adapter:614) curr allocated at ffffffffa0326a40 [ 10.861092] gim info:(set_new_adapter:621) SRIOV is supported [ 10.861098] gim info:(set_new_adapter:629) found PCI bridge device [ 10.861100] gim info:(set_new_adapter:633) found: 00:2.0 [ 10.861122] gim info:(set_new_adapter:650) mmio_base = ffffc90007180000 [ 10.861130] gim info:(set_new_adapter:652) doorbell = ffffc90007c00000 [ 10.861147] gim info:(set_new_adapter:654) pf.fb_va = ffffc90040000000 [ 10.861159] gim info:(sriov_is_ari_enabled:164) PCI_SRIOV_CAP = 0x00000002 [ 10.861162] gim info:(sriov_is_ari_enabled:173) PCI_SRIOV_CTRL = 0x00000010 [ 10.861163] gim info:(sriov_is_ari_enabled:176) PCI_SRIOV_CTRL_ARI is set --> ARI is supported [ 10.861167] gim info:(program_ari_mode:483) Read bif_strap8 = 0x00200004 [ 10.861169] gim info:(program_ari_mode:488) program_ari_mode - Set ARI_Mode = PF_BUS [ 10.861171] gim info:(program_ari_mode:498) Write bif_strap8 = 0x00000004 [ 10.861173] gim info:(gim_read_rom_from_reg:222) Reading VBios from ROM [ 10.861290] gim info:(gim_read_vbios:284) VBIOS starts: 0x55, 0xaa [ 10.861292] gim info:(gim_read_vbios:287) VBios size is 0x10000 [ 10.861299] gim info:(gim_read_vbios:290) vbios allocated at ffffc90006d7d000 [ 10.861301] gim info:(gim_read_rom_from_reg:222) Reading VBios from ROM [ 10.994958] gim info:(gim_read_vbios:298) BIOS Version Major 0xF Minor 0x31 [ 10.994999] gim info:(gim_read_vbios:311) Valid video BIOS image, [ 10.995000] gim info:(gim_read_vbios:313) size = 0x10000, check sum is 0x53b100 [ 10.995011] gim info:(gim_post_vbios:344) Init Parser passed!, continue [ 10.995015] gim info:(atom_chk_asic_status:335) ATOM_CheckAsicStatus - BIOS_SCRATCH_7 = 0x00000000 [ 10.995016] gim info:(atom_chk_asic_status:338) Isolate ATOM_S7_ASIC_INIT_COMPLETE_MASK bit(s) = 0x00000000 [ 10.995018] gim info:(atom_chk_asic_status:341) RLC_CNTL = 0x00000000 [ 10.995019] gim info:(atom_chk_asic_status:343) Isolate RLC_CNTL__RLC_ENABLE_F32_MASK = 0x00000000 [ 10.995020] gim info:(atom_chk_asic_status:350) ATOM_ASIC_NEED_POST [ 10.995021] gim info:(gim_post_vbios:347) Asic needs a VBios post [ 10.995022] gim info:(atom_post_vbios:200) ATOM_PostVBIOS: firmware_info passed [ 10.995023] gim info:(atom_post_vbios:253) asic_init before, engine clock = 7530; memory clock =1e848 [ 11.320309] gim info:(atom_post_vbios:256) asic_init after [ 11.320312] gim info:(atom_post_vbios:263) atom_init_fan_cntl before [ 11.320313] gim info:(atom_post_vbios:265) atom_init_fan_cntl after [ 11.320314] gim info:(gim_post_vbios:353) Post INIT_ASIC successfully! [ 11.320327] gim info:(firmware_requires_update:512) SMU option ROM version 0x111700 [ 11.320328] gim info:(firmware_requires_update:513) versus patch version 0x111a00 [ 11.320339] gim info:(firmware_requires_update:523) RLCV option ROM version 113 versus patch version 113 [ 11.320340] gim info:(firmware_requires_update:528) TOC found, update it [ 11.320341] gim info:(patch_firmware:579) Update smc_init table [ 11.882360] gim info:(patch_firmware:612) Update smu firmware [ 11.892326] gim info:(patch_firmware:630) Update TOC [ 11.896239] gim info:(gim_post_vbios:359) Asic needs firmware loaded [ 11.896241] gim info:(atom_post_vbios:200) ATOM_PostVBIOS: firmware_info passed [ 11.896242] gim info:(atom_post_vbios:239) just load uCode [ 11.896243] gim info:(atom_post_vbios:253) asic_init before, engine clock = 7530; memory clock =1e848 [ 13.172548] gim info:(atom_post_vbios:256) asic_init after [ 13.172551] gim info:(atom_post_vbios:263) atom_init_fan_cntl before [ 13.172552] gim info:(atom_post_vbios:265) atom_init_fan_cntl after [ 13.172553] gim info:(gim_post_vbios:365) Post LOAD_FW successfully! [ 13.172554] gim info:(gim_post_vbios:366) Post VBIOS successfully! [ 13.172555] gim info:(set_new_adapter:701) gim_post_vbios done [ 13.172557] gim info:(set_new_adapter:739) Scheduler Time interval is per-vf from XL [ 13.172557] gim info:(set_new_adapter:740) config file [ 13.172559] gim info:(enable_sriov:295) Enable SRIOV [ 13.172560] gim info:(enable_sriov:296) Enable SRIOV vfs count = 16 [ 13.277685] gim info:(enumerate_vfs:123) vf found: 04:2.0 [ 13.277702] gim info:(enumerate_vfs:123) vf found: 04:2.1 [ 13.277727] gim info:(enumerate_vfs:123) vf found: 04:2.2 [ 13.277740] gim info:(enumerate_vfs:123) vf found: 04:2.3 [ 13.277753] gim info:(enumerate_vfs:123) vf found: 04:2.4 [ 13.277766] gim info:(enumerate_vfs:123) vf found: 04:2.5 [ 13.277779] gim info:(enumerate_vfs:123) vf found: 04:2.6 [ 13.277792] gim info:(enumerate_vfs:123) vf found: 04:2.7 [ 13.277805] gim info:(enumerate_vfs:123) vf found: 04:3.0 [ 13.277817] gim info:(enumerate_vfs:123) vf found: 04:3.1 [ 13.277830] gim info:(enumerate_vfs:123) vf found: 04:3.2 [ 13.277843] gim info:(enumerate_vfs:123) vf found: 04:3.3 [ 13.277856] gim info:(enumerate_vfs:123) vf found: 04:3.4 [ 13.277869] gim info:(enumerate_vfs:123) vf found: 04:3.5 [ 13.277882] gim info:(enumerate_vfs:123) vf found: 04:3.6 [ 13.277895] gim info:(enumerate_vfs:123) vf found: 04:3.7 [ 13.277928] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.0 [ 13.277930] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.277934] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.277953] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.1 [ 13.277955] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.277958] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.277979] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.2 [ 13.277980] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.277984] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278004] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.3 [ 13.278005] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278008] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278028] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.4 [ 13.278030] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278033] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278053] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.5 [ 13.278055] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278058] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278078] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.6 [ 13.278079] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278083] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278102] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:2.7 [ 13.278104] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278107] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278127] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.0 [ 13.278128] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278131] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278151] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.1 [ 13.278153] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278156] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278176] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.2 [ 13.278178] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278181] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278206] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.3 [ 13.278208] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278211] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278231] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.4 [ 13.278233] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278236] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278256] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.5 [ 13.278257] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278261] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278280] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.6 [ 13.278282] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278285] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278305] gim info:(pci_disable_error_reporting:764) Disable error reporting for device: 04:3.7 [ 13.278307] gim info:(pci_disable_error_reporting:769) Mask before -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278311] gim info:(pci_disable_error_reporting:780) Mask after -> corr = 0x00000000, uncorr = 0x00000000 [ 13.278320] gim info:(pci_gpu_iov_init:87) total_fb_available = 8190 [ 13.278321] gim info:(pci_gpu_iov_init:88) AMD GIM pci_gpu_iov_init pos = 400 [ 13.278322] gim info:(pci_gpu_iov_init:90) AMD GIM pci_gpu_iov_init total_fb_available = 1ffe [ 13.278323] gim info:(init_frame_buffer_partition:190) PCI defined PF FB size = 256 MB [ 13.278324] gim info:(init_frame_buffer_partition:194) PCI defined VF FB size = 256 MB [ 13.278325] gim info:(init_frame_buffer_partition:200) Total FB Available = 8190 MB, CSA = 8 MB, Max remaining FBsize =8160 MB [ 13.278327] gim info:(init_frame_buffer_partition:201) max_fb_size = 8160 [ 13.278328] gim info:(init_frame_buffer_partition:214) PF FB size after checking limits from config file = 256MB [ 13.278329] gim info:(init_frame_buffer_partition:217) PF rounded down to nearest 16MB boundary = 256 [ 13.278330] gim info:(init_pf_fb:60) total framebuffer available = 1ffe [ 13.278331] gim info:(init_pf_fb:61) pf framebuffer = 100 [ 13.278332] gim info:(init_pf_fb:63) total framebuffer consumed = 1efe [ 13.278333] gim info:(init_frame_buffer_partition:224) CSA starts at offset 256MB [ 13.278334] gim info:(init_context_save_area:42) AMD GIM init_context_save_area: base =100 size=1. [ 13.278336] gim info:(init_frame_buffer_partition:231) VF FB base = 272MB (256 + 16) [ 13.278337] gim info:(init_frame_buffer_partition:235) VF FB Size = 7904MB (8160 - 256) [ 13.278339] gim info:(init_fb_static:118) AMD GIM init_fb_static: num_vf = 16, base= 272, total_size=7904, mini_size=256 [ 13.278340] gim info:(init_fb_static:139) AMD GIM init_fb_static: vf_fb_size = 480, base= 272 [ 13.278341] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 0 base =272,size= 480 [ 13.278343] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 1 base =752,size= 480 [ 13.278344] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 2 base =1232,size= 480 [ 13.278345] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 3 base =1712,size= 480 [ 13.278347] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 4 base =2192,size= 480 [ 13.278348] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 5 base =2672,size= 480 [ 13.278349] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 6 base =3152,size= 480 [ 13.278350] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 7 base =3632,size= 480 [ 13.278352] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 8 base =4112,size= 480 [ 13.278353] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 9 base =4592,size= 480 [ 13.278354] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 10 base =5072,size= 480 [ 13.278356] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 11 base =5552,size= 480 [ 13.278357] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 12 base =6032,size= 480 [ 13.278358] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 13 base =6512,size= 480 [ 13.278360] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 14 base =6992,size= 480 [ 13.278361] gim info:(init_fb_static:150) AMD GIM init_fb_static: partition 15 base =7472,size= 480 [ 13.278365] gim info:(set_new_adapter:789) enable MSI [ 13.278367] gim error:(set_new_adapter:791) fail to enable MSI [ 13.278368] gim info:(gim_probe:86) AMD GIM probe: pf_count = 1