thomas-krenn / check_lsi_raid

Monitoring plugin to check MegaRAID controllers
GNU General Public License v3.0
59 stars 27 forks source link

Error: invalid controller number, controller not found! / show time issue #15

Closed jirib closed 6 years ago

jirib commented 6 years ago

Hello,

it seems there's no caught an issue when 'show time' shows unexpected output.

# ./check_lsi_raid -p /opt/MegaRAID/storcli/storcli64
Error: invalid controller number, controller not found!

Well, my controller got completely crazy...:

# storcli /c0 show time ; echo $?
Controller = 0
Status = Failure
Description = None

Detailed Status :
===============

-------------------------------------------------------
Ctrl Status Ctrl_Prop Value ErrMsg               ErrCd 
-------------------------------------------------------
   0 Failed Time      -     CTRL_TIME_GET failed    49 
-------------------------------------------------------

49

Complete output of 'show all':

Generating detailed summary of the adapter, it may take a while to complete.

Controller = 0
Status = Success
Description = None

Basics :
======
Controller = 0
Model = ServeRAID M5015 SAS/SATA Controller
Serial Number = SV03201033
Current Controller Date/Time = 08/26/2135, 02:24:31
Current System Date/time = 02/08/2018, 11:06:29
SAS Address = 500605b002827b50
PCI Address = 00:01:00:00
Mfg Date = 08/01/10
Rework Date = 00/00/00
Revision No = 

Version :
=======
Firmware Package Build = 12.0.1-0090
Firmware Version = 2.0.13-0748
Bios Version = 3.09.00
Preboot CLI Version = 02.00-015:#%00008
WebBIOS Version = 3.0-22-e_12-Rel
NVDATA Version = 2.02.0038
Boot Block Version = 2.00.00.00-0018
Bootloader Version = 01.250.04.219
Driver Name = megaraid_sas
Driver Version = 07.701.17.00-rh1

Bus :
===
Vendor Id = 0x1000
Device Id = 0x79
SubVendor Id = 0x1014
SubDevice Id = 0x3B2
Host Interface = PCI-E
Device Interface = SAS-6G
Bus Number = 1
Device Number = 0
Function Number = 0

Pending Images in Flash :
=======================
Image name = No pending images

Status :
======
Controller Status = Needs Attention
Memory Correctable Errors = 0
Memory Uncorrectable Errors = 0
ECC Bucket Count = 0
Any Offline VD Cache Preserved = No
BBU Status = 32
Support PD Firmware Download = No
Lock Key Assigned = No
Failed to get lock key on bootup = No
Lock key has not been backed up = No
Bios was not detected during boot = No
Controller must be rebooted to complete security operation = No
A rollback operation is in progress = No
At least one PFK exists in NVRAM = No
SSC Policy is WB = No
Controller has booted into safe mode = No

Supported Adapter Operations :
============================
Rebuild Rate = Yes
CC Rate = Yes
BGI Rate  = Yes
Reconstruct Rate = Yes
Patrol Read Rate = Yes
Alarm Control = Yes
Cluster Support = No
BBU  = Yes
Spanning = Yes
Dedicated Hot Spare = Yes
Revertible Hot Spares = Yes
Foreign Config Import = Yes
Self Diagnostic = Yes
Allow Mixed Redundancy on Array = No
Global Hot Spares = Yes
Deny SCSI Passthrough = No
Deny SMP Passthrough = No
Deny STP Passthrough = No
Support more than 8 Phys = Yes
FW and Event Time in GMT = No
Support Enhanced Foreign Import = Yes
Support Enclosure Enumeration = Yes
Support Allowed Operations = Yes
Abort CC on Error = Yes
Support Multipath = Yes
Support Odd & Even Drive count in RAID1E = No
Support Security = No
Support Config Page Model = Yes
Support the OCE without adding drives = No
Support EKM = No
Snapshot Enabled = No
Support PFK = No
Support PI = No
Support Ld BBM Info = No
Support Shield State = No
Block SSD Write Disk Cache Change = No
Support Suspend Resume BG ops = No
Support Emergency Spares = No
Support Set Link Speed = No
Support Boot Time PFK Change = No
Support JBOD = No
Disable Online PFK Change = No
Support Perf Tuning = No
Support SSD PatrolRead = No
Real Time Scheduler = No
Support Reset Now = No
Support Emulated Drives = No
Headless Mode = No
Dedicated HotSpares Limited = No
Point In Time Progress = No
Extended LD = No
Boot Volume Supported = No
Support Uneven span  = No
Support Config Auto Balance = No
Support Maintenance Mode = No
Support Diagnostic results = No
Support Ext Enclosure = No
Support Sesmonitoring = No
Support SecurityonJBOD = No
Support ForceFlash = No
Support DisableImmediateIO = No
Support DrvActivityLEDSetting = No
Support CPLDUpdate = No
Support ForceTo512e = No
Support discardCacheDuringLDDelete = No
Support JBOD Write cache = No

Supported PD Operations :
=======================
Force Online = Yes
Force Offline = Yes
Force Rebuild = Yes
Deny Force Failed = No
Deny Force Good/Bad = No
Deny Missing Replace = No
Deny Clear = No
Deny Locate = No
Support Power State = Yes
Set Power State For Cfg = Yes
Support T10 Power State = No
Support Temperature = No
NCQ = No
Support Max Rate SATA = No

Supported VD Operations :
=======================
Read Policy = Yes
Write Policy = Yes
IO Policy = Yes
Access Policy = Yes
Disk Cache Policy = Yes
Reconstruction = Yes
Deny Locate = No
Deny CC = No
Allow Ctrl Encryption = No
Enable LDBBM = No
Support FastPath = No
Performance Metrics = No
Power Savings = No
Support Powersave Max With Cache = No
Support Breakmirror = No
Support SSC WriteBack = No
Support SSC Association = No
Support VD Hide = No
Support VD Cachebypass = No
Support VD discardCacheDuringLDDelete = No

HwCfg :
=====
ChipRevision =  
BatteryFRU = N/A
Front End Port Count = 0
Backend Port Count = 8
BBU = Present
Alarm = Disable
Serial Debugger = Present
NVRAM Size = 32KB
Flash Size = 8MB
On Board Memory Size = 512MB
CacheVault Flash Size = NA
TPM = Absent
Upgrade Key = Absent
On Board Expander = Absent
Temperature Sensor for ROC = Absent
Temperature Sensor for Controller = Absent
Upgradable CPLD = Absent
Current Size of CacheCade (GB) = 0
Current Size of FW Cache (MB) = 0

Policies :
========

Policies Table :
==============

------------------------------------------------
Policy                          Current Default 
------------------------------------------------
Predictive Fail Poll Interval   300 sec         
Interrupt Throttle Active Count 16              
Interrupt Throttle Completion   50 us           
Rebuild Rate                    30 %    30%     
PR Rate                         30 %    30%     
BGI Rate                        30 %    30%     
Check Consistency Rate          30 %    30%     
Reconstruction Rate             30 %    30%     
Cache Flush Interval            4s              
------------------------------------------------

Flush Time(Default) = 4s
Drive Coercion Mode = 1GB
Auto Rebuild = On
Battery Warning = On
ECC Bucket Size = 15
ECC Bucket Leak Rate (hrs) = 24
Restore HotSpare on Insertion = Off
Expose Enclosure Devices = On
Maintain PD Fail History = On
Reorder Host Requests = On
Auto detect BackPlane = SGPIO/i2c SEP
Load Balance Mode = Auto
Security Key Assigned = Off
Disable Online Controller Reset = Off
Use drive activity for locate = Off

Boot :
====
BIOS Enumerate VDs = 1
Stop BIOS on Error = On
Delay during POST = 4
Spin Down Mode = None
Enable Ctrl-R = No
Enable Web BIOS = Yes
Enable PreBoot CLI = Yes
Enable BIOS = Yes
Max Drives to Spinup at One Time = 2
Maximum number of direct attached drives to spin up in 1 min = 0
Delay Among Spinup Groups (sec) = 12
Allow Boot with Preserved Cache = Off

High Availability :
=================
Topology Type = None
Cluster Permitted = No
Cluster Active = No

Defaults :
========
Phy Polarity = 0
Phy PolaritySplit = 0
Strip Size = 128kB
Write Policy = WB
Read Policy = No Read Ahead
Cache When BBU Bad = Off
Cached IO = Off
VD PowerSave Policy = Controller Defined
Default spin down time (mins) = 0
Coercion Mode = 1 GB
ZCR Config = Unknown
Max Chained Enclosures = 16
Direct PD Mapping = No
Restore Hot Spare on Insertion = No
Expose Enclosure Devices = Yes
Maintain PD Fail History = Yes
Zero Based Enclosure Enumeration = No
Disable Puncturing = Yes
EnableLDBBM = No
DisableHII = No
Un-Certified Hard Disk Drives = Allow
SMART Mode = Mode 6
Enable LED Header = No
LED Show Drive Activity = No
Dirty LED Shows Drive Activity = No
EnableCrashDump = No
Disable Online Controller Reset = No
Treat Single span R1E as R10 = No
Power Saving option = Enabled
TTY Log In Flash = No
Auto Enhanced Import = No
BreakMirror RAID Support = No
Disable Join Mirror = No
Enable Shield State = No
Time taken to detect CME = 60 sec

Capabilities :
============
Supported Drives = SAS, SATA
Boot Volume Supported = NO
RAID Level Supported = RAID0, RAID1, RAID5, RAID00, RAID10, RAID50, 
, PRL 11, PRL 11 with spanning, SRL 3 supported, 
PRL11-RLQ0 DDF layout with no span, PRL11-RLQ0 DDF layout with span
Enable JBOD = No
Mix in Enclosure = Allowed
Mix of SAS/SATA of HDD type in VD = Not Allowed
Mix of SAS/SATA of SSD type in VD = Not Allowed
Mix of SSD/HDD in VD = Not Allowed
SAS Disable = No
Max Arms Per VD = 32
Max Spans Per VD = 8
Max Arrays = 128
Max VD per array = 16
Max Number of VDs = 64
Max Parallel Commands = 1008
Max SGE Count = 80
Max Data Transfer Size = 8192 sectors
Max Strips PerIO = 42
Max Configurable CacheCade Size(GB) = 0
Min Strip Size = 8 KB
Max Strip Size = 1.0 MB

Scheduled Tasks :
===============
Consistency Check Reoccurrence = 168 hrs
Next Consistency check launch = 04/15/2017, 03:00:00
Patrol Read Reoccurrence = 168 hrs
Next Patrol Read launch = 04/15/2017, 03:00:00
Battery learn Reoccurrence = 720 hrs
Next Battery Learn = 04/15/2017, 10:00:00
OEMID = IBM

Drive Groups = 1

TOPOLOGY :
========

--------------------------------------------------------------------------
DG Arr Row EID:Slot DID Type  State BT       Size PDC  PI SED DS3  FSpace 
--------------------------------------------------------------------------
 0 -   -   -        -   RAID1 Dgrd  N  464.729 GB enbl N  N   dflt N      
 0 0   -   -        -   RAID1 Dgrd  N  464.729 GB enbl N  N   dflt N      
 0 0   0   252:0    11  DRIVE Offln N  464.729 GB enbl N  N   dflt -      
 0 0   1   252:0    13  DRIVE Onln  N  464.729 GB enbl N  N   dflt -      
--------------------------------------------------------------------------

DG=Disk Group Index|Arr=Array Index|Row=Row Index|EID=Enclosure Device ID
DID=Device ID|Type=Drive Type|Onln=Online|Rbld=Rebuild|Dgrd=Degraded
Pdgd=Partially degraded|Offln=Offline|BT=Background Task Active
PDC=PD Cache|PI=Protection Info|SED=Self Encrypting Drive|Frgn=Foreign
DS3=Dimmer Switch 3|dflt=Default|Msng=Missing|FSpace=Free Space Present

Virtual Drives = 1

VD LIST :
=======

-----------------------------------------------------------
DG/VD TYPE  State Access Consist Cache sCC       Size Name 
-----------------------------------------------------------
0/0   RAID1 Dgrd  RW     Yes     RWTC  -   464.729 GB      
-----------------------------------------------------------

Cac=CacheCade|Rec=Recovery|OfLn=OffLine|Pdgd=Partially Degraded|dgrd=Degraded
Optl=Optimal|RO=Read Only|RW=Read Write|HD=Hidden|B=Blocked|Consist=Consistent|
R=Read Ahead Always|NR=No Read Ahead|WB=WriteBack|
AWB=Always WriteBack|WT=WriteThrough|C=Cached IO|D=Direct IO|sCC=Scheduled
Check Consistency

Physical Drives = 8

PD LIST :
=======

------------------------------------------------------------------------------------------------
EID:Slt DID State DG       Size Intf Med SED PI SeSz Model                                   Sp 
------------------------------------------------------------------------------------------------
252:0    11 Offln  0 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:0    13 Onln   0 464.729 GB SATA HDD N   N  512B ST9500620NS          81Y9715 81Y3856IBM U  
252:1    10 UGood  - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:1    12 UGood  - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:2     9 UGood  - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:2    14 UGood  - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:3     8 UGood  - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
252:3    15 UBad   - 464.729 GB SATA HDD N   N  512B ST9500530NS          42D0743 42D0746IBM U  
------------------------------------------------------------------------------------------------

EID-Enclosure Device ID|Slt-Slot No.|DID-Device ID|DG-DriveGroup
DHS-Dedicated Hot Spare|UGood-Unconfigured Good|GHS-Global Hotspare
UBad-Unconfigured Bad|Onln-Online|Offln-Offline|Intf-Interface
Med-Media Type|SED-Self Encryptive Drive|PI-Protection Info
SeSz-Sector Size|Sp-Spun|U-Up|D-Down|T-Transition|F-Foreign
UGUnsp-Unsupported|UGShld-UnConfigured shielded|HSPShld-Hotspare shielded
CFShld-Configured shielded|Cpybck-CopyBack|CBShld-Copyback Shielded

BBU_Info :
========

------------------------------------------------------------------------------------
Model State                 RetentionTime Temp Mode MfgDate    Next Learn           
------------------------------------------------------------------------------------
iBBU  Dgd (Needs Attention) 0 hour(s)     34C  -    2010/06/12 2017/04/15  10:49:17 
------------------------------------------------------------------------------------
jirib commented 6 years ago

The controller is crazy = check EIE:Slt for fun :)

gschoenberger commented 6 years ago

Closing this issue as the controller seems not to be running sane...

nenominal commented 4 years ago

The controller is crazy = check EIE:Slt for fun :)

I have the same issue. Could you please be more specific?

Thanks!

jirib commented 4 years ago

The controller is crazy = check EIE:Slt for fun :)

I have the same issue. Could you please be more specific?

Thanks!

I don't have access to this hardware anymore. @pkubica ?

Fadder-76149 commented 3 years ago

Please check the summary on screen, as you can see: `Current Controller Date/Time = 08/26/2135, 02:24:31

Current System Date/time = 02/08/2018, 11:06:29`

Try to sync the controllers time: "storcli /call set time=systemtime" If done, the script will work like a charme.

Cheers