RedPill-TTG / redpill-lkm

Linux kernel module for RedPill
GNU General Public License v3.0
307 stars 174 forks source link

my DSM Version: 918+_7.0.1_42218 problem: Preventing Storage Pool Creation,Health Status Not Supported #26

Open ranydb opened 2 years ago

ranydb commented 2 years ago

my DSM Version: 918+_7.0.1_42218 HBA card: LSI 9400 16i problem: Preventing Storage Pool Creation,Health Status Not Supported

微信截图_20211015121355 微信截图_20211015121535

but in terminal use smartctl , I got normal information.

root@test:/etc# smartctl -a /dev/sdj
smartctl 6.5 (build date Feb 20 2021) [x86_64-linux-4.4.180+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Ultrastar
Device Model:     WDC  WUH721818ALE6L4
Serial Number:    3WKVUKLK
LU WWN Device Id: 5 000cca 284f67823
Firmware Version: PCGNW232
User Capacity:    18,000,207,937,536 bytes [18.0 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   Unknown(0x0ffc) (unknown minor revision code: 0x009c)
SATA Version is:  SATA >3.2 (0x1ff), 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Fri Oct 15 12:16:49 2021 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x80) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  101) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        (1894) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME                                                   FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate                                              0x000b   100   100   001    Pre-fail  Always       -       0
  2 Throughput_Performance                                           0x0005   100   100   054    Pre-fail  Offline      -       0
  3 Spin_Up_Time                                                     0x0007   099   099   001    Pre-fail  Always       -       25769803845
  4 Start/Stop_Count                                                 0x0012   100   100   000    Old_age   Always       -       1
  5 Reallocated_Sector_Count                                         0x0033   100   100   001    Pre-fail  Always       -       0
  7 Seek_Error_Rate                                                  0x000b   100   100   001    Pre-fail  Always       -       0
  8 Seek_Time_Performance                                            0x0005   100   100   020    Pre-fail  Offline      -       0
  9 Power-On_Hours_Count                                             0x0012   100   100   000    Old_age   Always       -       17
 10 Spin_Retry_Count                                                 0x0013   100   100   001    Pre-fail  Always       -       0
 12 Device_Power_Cycle_Count                                         0x0032   100   100   000    Old_age   Always       -       1
 22 Internal_Environment_Status                                      0x0023   100   100   025    Pre-fail  Always       -       100
192 Power_off_Retract_Count                                          0x0032   100   100   000    Old_age   Always       -       5
193 Load_Cycle_Count                                                 0x0012   100   100   000    Old_age   Always       -       5
194 Temperature                                                      0x0002   046   046   000    Old_age   Always       -       46 (Min/Max 25/47)
196 Reallocation_Event_Count                                         0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector_Count                                     0x0022   100   100   000    Old_age   Always       -       0
198 Off-Line_Scan_Uncorrectable_Sector_Count                         0x0008   100   100   000    Old_age   Offline      -       0
199 Ultra_DMA_CRC_Error_Count                                        0x000a   100   100   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute `delay.`

so any idea?

Originally posted by @ranydb in https://github.com/RedPill-TTG/redpill-lkm/issues/14#issuecomment-943972532

ranydb commented 2 years ago

log from /var/log/messages

2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdk] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: StorageManager.cpp:957 Failed to get disk /dev/sdk's serial
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: disk/disk_is_mv_soc_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: disk/disk_is_mv1475_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: DoSmartFirmAndSerialRead(219) AtaSmartFirmAndSerialRead fail
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdl] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T12:47:52+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[15313]: StorageManager.cpp:957 Failed to get disk /dev/sdl's serial
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_is_mv_soc_driver.c:71 Can't get sata chip name from pattern /sys/block/sdk/device/../../scsi_host/host*/proc_name
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_is_mv1475_driver.c:71 Can't get sata chip name from pattern /sys/block/sdk/device/../../scsi_host/host*/proc_name
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: DoSmartFirmAndSerialRead(219) AtaSmartFirmAndSerialRead fail
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdk] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: StorageManager.cpp:957 Failed to get disk /dev/sdk's serial
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_is_mv_soc_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_is_mv1475_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: DoSmartFirmAndSerialRead(219) AtaSmartFirmAndSerialRead fail
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdl] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T12:59:29+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[17862]: StorageManager.cpp:957 Failed to get disk /dev/sdl's serial
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_is_mv_soc_driver.c:71 Can't get sata chip name from pattern /sys/block/sdk/device/../../scsi_host/host*/proc_name
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_is_mv1475_driver.c:71 Can't get sata chip name from pattern /sys/block/sdk/device/../../scsi_host/host*/proc_name
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: DoSmartFirmAndSerialRead(219) AtaSmartFirmAndSerialRead fail
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdk] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: StorageManager.cpp:957 Failed to get disk /dev/sdk's serial
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_is_mv_soc_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_is_mv1475_driver.c:71 Can't get sata chip name from pattern /sys/block/sdl/device/../../scsi_host/host*/proc_name
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: DoSmartFirmAndSerialRead(219) AtaSmartFirmAndSerialRead fail
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: disk/disk_serial_get.c:35 SmartFirmAndSerialRead failed on [/dev/sdl] [0x6C00 smartctl/smartctl_ata_command_interface.c:135]
2021-10-16T13:06:03+08:00 test entry.cgi_SYNO.Storage.CGI.Storage_1_get_sche_task_list[20079]: StorageManager.cpp:957 Failed to get disk /dev/sdl's serial
swanduron commented 2 years ago

The same issue occurred in my testbed, all disks were marked as access error and can't build data storage.

pocopico commented 2 years ago

What version of redpill are you using ? Are you sure you have compiled from the latest source ? It seems like an old error

swanduron commented 2 years ago

Hello expert, thanks for your help. My test environment is based on ESXi 7.0u3 and followed the guidance of the below link:

https://github.com/tossp/redpill-tool-chain/blob/master/README_EN.md