Closed kills closed 5 months ago
Just to add to the above, created a new bootloader and migrated the data to a DS3622xs and its successfully rebuilding and scrubbing without any freezing. I retained the existing configurations when migrating. So likely there is something up with the SA6400 configuration.
The logs you provided don't seem to show any clues about a kernel panic directly about mpt3sas. I often see raid456, so I am suspicious of this module. First, let's check the latest version of rr's mpt3sas module and include it again in sa6400.
Thank you for your dedication to this project Peter.
Just to add, the above errors were from my main NAS.
I have built a test NAS on some spare parts I had available, including an Avago mpt3sas raid card with 5 disks attached. Happy to test any beta or potential new drivers if you like.
I was interested to see how RR performs and grabbed the latest RR today (24.1.0) and have that running a SA6400 configuration for the past 6 hours, no crashes so far. I will continue to monitor and report back any findings.
Did you end up seeing any differences between the driver versions?
Update: Scrubbing completed successfully without any freezes or errors.
In the following content included in the last release of Mshell: Extract mpt3sas.ko and
https://github.com/PeterSuh-Q3/arpl-modules/releases/download/v1.64/epyc7002-7.2-5.10.55.tgz
I extracted the same mpt3sas.ko from the rr 24.1.0 integrated module and compared it. Unfortunately, nothing has changed.
I also have Dell Perc H200, H310 cards that use the same chipset as yours. Since it has been confirmed that SHELL is using the same module, we will install the disk and monitor its stability for a few days.
Today, in mshell, we made it almost identical to the mechanism of rr so that the HBA operates stably. All HBAs that did not work until now will work. If possible, please test it.
Hi Peter,
Thank you for this awesome loader. I wanted to let you know I've run into some strange freezing and lock ups, while trying to run raid rebuilds and data scrubbing using SA6400.
Is it possible that the MPT3SAS Drivers have some issues still? Below are some of the logs I captured:
Also captured some other messages:
I have seen some of the other loaders, RR & ARC, mention they've updated their MPT3SAS Drivers recently due to some issues with the SA6400. Is it possible that there could be a bug with the current ones in your tinycore loader?
Appreciate any help you can provide.
Thank you,
Kills