xcat2 / xcat-core

Code repo for xCAT core packages
Eclipse Public License 1.0
367 stars 172 forks source link

[DEV] rinv command get error message on Habanero #2335

Closed xuweibj closed 7 years ago

xuweibj commented 7 years ago
[root@c712ems5-pvt ~]# rinv c712f11n04 model
Error: ipmi plugin bug, pid 46719, process description: 'xcatd SSL: rinv to c712f11n04 for root@localhost: ipmi instance' with error 'Can't call method "id_string" on an undefined value at /opt/xcat/lib/perl/xCAT_plugin/ipmi.pm line 3587.
' while trying to fulfill request for the following nodes: c712f11n04
[root@c712ems5-pvt ~]# rinv c712f11n04 firm
Error: ipmi plugin bug, pid 46732, process description: 'xcatd SSL: rinv to c712f11n04 for root@localhost: ipmi instance' with error 'Can't call method "id_string" on an undefined value at /opt/xcat/lib/perl/xCAT_plugin/ipmi.pm line 3587.
' while trying to fulfill request for the following nodes: c712f11n04 
[root@c712ems5-pvt ~]# rinv c712f11n04 all
c712f11n04: Received incorrect data from BMC for FRU ID: 43
c712f11n04: BMC Firmware: 2.08
c712f11n04: Product ID: 43707
c712f11n04: Manufacturer ID: 0
c712f11n04: Backplane Board Part Number: SP012GMR-IGX
c712f11n04: UUID/GUID: 323F42A0-2900-0384-0010-DEBFD257106B
c712f11n04: Device ID: 32
c712f11n04: Backplane Board Serial Number: CUDD2AF3805M
c712f11n04: Backplane Board Manufacturer: TYAN
c712f11n04: Backplane Board : SYSTEM BACKPLANE
c712f11n04: Backplane Chassis Part Number: 8348-21C
c712f11n04: Backplane Chassis Serial Number: 843F02A
c712f11n04: CPU Board Part Number: 00UM003
c712f11n04: CPU Board Serial Number: YA1932784742
c712f11n04: CPU Board Manufacturer: IBM
c712f11n04: CPU Board : PROCESSOR MODULE
c712f11n04: CPU Board Additional Info: ECID:019A006540203718060A0000008239F2
c712f11n04: DIMM0 Product Serial Number: 10 cd aa 3a (    )
c712f11n04: DIMM0 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM0 Product Version: 34 54 (4T)
c712f11n04: DIMM0 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM1: clear
c712f11n04: DIMM10 Product Serial Number: 10 c2 db d7 (    )
c712f11n04: DIMM10 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM10 Product Version: 34 54 (4T)
c712f11n04: DIMM10 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM11: clear
c712f11n04: DIMM12 Product Serial Number: 10 cd a9 60 (    )
c712f11n04: DIMM12 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM12 Product Version: 34 54 (4T)
c712f11n04: DIMM12 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM13: clear
c712f11n04: DIMM14 Product Serial Number: 10 cd a9 9e (    )
c712f11n04: DIMM14 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM14 Product Version: 34 54 (4T)
c712f11n04: DIMM14 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM15: clear
c712f11n04: DIMM16 Product Serial Number: 10 c2 d6 b8 (    )
c712f11n04: DIMM16 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM16 Product Version: 34 54 (4T)
c712f11n04: DIMM16 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM17: clear
c712f11n04: DIMM18 Product Serial Number: 41 17 b5 ce (A   )
c712f11n04: DIMM18 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM18 Product Version: 34 54 (4T)
c712f11n04: DIMM18 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM19: clear
c712f11n04: DIMM2 Product Serial Number: 41 17 b5 9f (A   )
c712f11n04: DIMM2 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM2 Product Version: 34 54 (4T)
c712f11n04: DIMM2 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM20 Product Serial Number: 41 17 b6 0a (A   )
c712f11n04: DIMM20 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM20 Product Version: 34 54 (4T)
c712f11n04: DIMM20 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM21: clear
c712f11n04: DIMM22 Product Serial Number: 10 c2 d6 c4 (    )
c712f11n04: DIMM22 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM22 Product Version: 34 54 (4T)
c712f11n04: DIMM22 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM23: clear
c712f11n04: DIMM24 Product Serial Number: 10 cd a9 a5 (    )
c712f11n04: DIMM24 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM24 Product Version: 34 54 (4T)
c712f11n04: DIMM24 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM25: clear
c712f11n04: DIMM26 Product Serial Number: 41 17 b6 48 (A  H)
c712f11n04: DIMM26 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM26 Product Version: 34 54 (4T)
c712f11n04: DIMM26 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM27: clear
c712f11n04: DIMM28 Product Serial Number: 10 cd a9 ae (    )
c712f11n04: DIMM28 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM28 Product Version: 34 54 (4T)
c712f11n04: DIMM28 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM29: clear
c712f11n04: DIMM3: clear
c712f11n04: DIMM30 Product Serial Number: 10 cd a9 9c (    )
c712f11n04: DIMM30 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM30 Product Version: 34 54 (4T)
c712f11n04: DIMM30 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM31: clear
c712f11n04: DIMM4 Product Serial Number: 41 17 b5 41 (A  A)
c712f11n04: DIMM4 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM4 Product Version: 34 54 (4T)
c712f11n04: DIMM4 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM5: clear
c712f11n04: DIMM6 Product Serial Number: 10 cd a9 4f (   O)
c712f11n04: DIMM6 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM6 Product Version: 34 54 (4T)
c712f11n04: DIMM6 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM7: clear
c712f11n04: DIMM8 Product Serial Number: 41 17 b6 49 (A  I)
c712f11n04: DIMM8 Product Manufacturer: ad 80 (  )
c712f11n04: DIMM8 Product Version: 34 54 (4T)
c712f11n04: DIMM8 Product Model: HMT42GR7BFR4A-PB
c712f11n04: DIMM9: clear
c712f11n04: Membuf Board Additional Info: ECID:019A0083400857180B11000000E40341
c712f11n04: Membuf Board Additional Info: ECID:019A0083400857180D11000000E20391
c712f11n04: Membuf Board Additional Info: ECID:019A0083400857180A08000000ED0421
c712f11n04: Membuf Board Additional Info: ECID:019A0083400857180808000000BF04D1
c712f11n04: PSU1 Product Serial Number: HCVD1536002982
c712f11n04: PSU1 Product Manufacturer: DELTA
c712f11n04: PSU1 Product Version: 012
c712f11n04: PSU1 Product Model: 00F
c712f11n04: PSU2 Product Serial Number: HCVD1536002931
c712f11n04: PSU2 Product Manufacturer: DELTA
c712f11n04: PSU2 Product Version: 012
c712f11n04: PSU2 Product Model: 00F
c712f11n04: System Firmware Product Version: IBM-habanero-ibm-OP8_v1.7_1.62
c712f11n04: System Firmware Product Additional Info: hostboot-bc98d0b-1a29dff
c712f11n04: System Firmware Product Additional Info: occ-0362706-16fdfa7
c712f11n04: System Firmware Product Additional Info: skiboot-5.1.13
c712f11n04: System Firmware Product Additional Info: hostboot-binaries-43d5a59
c712f11n04: System Firmware Product Additional Info: habanero-xml-a71550e-cdd3b31
c712f11n04: System Firmware Product Additional Info: capp-ucode-105cb8f 
whowutwut commented 7 years ago

@xuweibj Are you currently working on this? which team are we borrowing these nodes from?

xuweibj commented 7 years ago

@whowutwut I haven't started my work on this. I will discuss with Ertao whether fix it in this sprint.

whowutwut commented 7 years ago

I was just wondering who owns the c712 system? I am not aware of any Habaneros that we have in our possession.

cxhong commented 7 years ago

The Habaneros system is used fru id 43 to get firmware information, our current ipmi.pm code used fru id 47.

# For openpower servers, fru 3 is used to get MTM/Serial information, fru 47 is used to get firmware information
            @{$sessdata->{frus_for_openpower}} = qw(3 47);

fru 47 and fru 43 output from Habaneros system:

[root@c712ems5 ~]# ipmitool-xcat -I lanplus -H 10.128.130.24 -U ADMIN -P admin fru print 47
 Device not present (Requested sensor, data, or record not found)
[root@c712ems5 ~]# ipmitool-xcat -I lanplus -H 10.128.130.24 -U ADMIN -P admin fru print 43
 Product Name          : OpenPOWER Firmware
 Product Version       : IBM-habanero-ibm-OP8_v1.7_1.62
 Product Extra         :        hostboot-bc98d0b-1a29dff
 Product Extra         :        occ-0362706-16fdfa7
 Product Extra         :        skiboot-5.1.13
 Product Extra         :        hostboot-binaries-43d5a59
 Product Extra         :        habanero-xml-a71550e-cdd3b31
 Product Extra         :        capp-ucode-105cb8f

fru 47 and fru 43 output from garrison system:

[root@fs4 ~]# ipmitool-xcat -I lanplus -H 50.23.17.1 -U ADMIN -P admin fru print 43
 Product Manufacturer  : ad80
 Product Name          : 0c
 Product Part Number   : HMA451R7AFR8N-TF
 Product Version       : 00
 Product Serial        : 71103d5d
[root@fs4 ~]# ipmitool-xcat -I lanplus -H 50.23.17.1 -U ADMIN -P admin fru print 47
 Product Name          : OpenPOWER Firmware
 Product Version       : IBM-garrison-ibm-OP8_v1.11_2.19
 Product Extra         :        op-build-6ce5903
 Product Extra         :        buildroot-81b8d98
 Product Extra         :        skiboot-5.3.7
 Product Extra         :        hostboot-1f6784d-02b09df
 Product Extra         :        linux-4.4.24-openpower1-5d537af
 Product Extra         :        petitboot-v1.2.6-8fa93f2
 Product Extra         :        garrison-xml-3db7b6e
 Product Extra         :        occ-69fb587
 Product Extra         :        hostboot-bina
zet809 commented 7 years ago

@cxhong Thx for your information, casandra, we need to think over how to fix it.

whowutwut commented 7 years ago

@zet809 Please clear the assignee if they are not working on it, we have it in the backlog to pull from.

whowutwut commented 7 years ago

There are some systems that @pdlun92 manages that we can use to fix this problem. If it's a low hanging fruit, I suggest that we fix this..

whowutwut commented 7 years ago

So looks like we would need to ensure that all the BMC related options are working. It seems like doing a rinv <node> without any options are OK, but once we get into a sub command, it fails. serial, firm, vpd, etc.... Maybe we have to dual path this, detect a Tyan MFG BMC and then support the different FRUs? @cxhong @zet809 ?

cxhong commented 7 years ago

For habanero system, the fru id should be 2 and 43 instead of 3 and 47

# ipmitool-xcat -I lanplus -H 10.128.130.24 -U ADMIN -P admin fru print 2
 Chassis Type          : Unknown
 Chassis Part Number   : 8348-21C
 Chassis Serial        : 843F02A
 Board Mfg Date        : Sun Dec 31 19:00:00 1995
 Board Mfg             : TYAN
 Board Product         : SYSTEM BACKPLANE
 Board Serial          : CUDD2AF3805M
 Board Part Number     : SP012GMR-IGX
# ipmitool-xcat -I lanplus -H 10.128.130.24 -U ADMIN -P admin fru print 43
 Product Name          : OpenPOWER Firmware
 Product Version       : IBM-habanero-ibm-OP8_v1.7_1.62
 Product Extra         :        hostboot-bc98d0b-1a29dff
 Product Extra         :        occ-0362706-16fdfa7
 Product Extra         :        skiboot-5.1.13
 Product Extra         :        hostboot-binaries-43d5a59
 Product Extra         :        habanero-xml-a71550e-cdd3b31
 Product Extra         :        capp-ucode-105cb8f
xuweibj commented 7 years ago

Verified