fbelavenuto / arpl

Automated Redpill Loader
GNU General Public License v3.0
7.15k stars 1.53k forks source link

Can't boot with more than 2 SATA drives #449

Closed sunhater closed 1 year ago

sunhater commented 1 year ago

I'm trying to boot DSM 7.1 on real hardware with 6 SATA ports. ASRock Z370M-ITX/ac. It boots without any problem when one or two drives only are connected no matter where the drives are connected. But when I connect more than two drives, the DSM won't boot (I wait more than hour without ping). I have additional SATA card that supports 6 more drives. "Show SATA(s) # ports and drives" recognises all connected drives (both on motherboard and sata card). My goal is to boot with 10 connected drives but for now it's impossible. I tried with DS3615xs, DS3617xs and DS3622xs+ without success. Is there any boot log to figure out what is happening?

sunhater commented 1 year ago

I have a little progress. The only way to boot and have more than two drives is to install and boot DSM with two connected drives and then connect the rest (without shutdown). Hotplug setting must be enabled in BIOS (for every port). But it is too annoying to disconnect drives before boot and connect them when DSM loads.

hdj8812 commented 1 year ago

I also encountered this problem

linlinlin530 commented 1 year ago

I encounter this issue too, can't boot with 4 HDD but success with 2 HDD.

DSM 7.1.1-42962 Update 3 ASRock B460M Steellegend Intel Core i5-10500ES (QSRK)

crazyb123 commented 1 year ago

I have same problem. You can try 1.0 beta5. Currently I use this version,and upgrade update3.

linlinlin530 commented 1 year ago

I have same problem. You can try 1.0 beta5. Currently I use this version,and upgrade update3.

Yes I remember that things could work properly in early version, and I am going to downgrade arpl and have a try.

chyzp commented 1 year ago

I am using arpl-1.0-beta9, and i cant boot with 6 disks bath in pve or physics mechine.

dasb00ter commented 1 year ago

I think I also have the same issue. arpl-1.0-beta9 ds918+ Baremetal on g3 800 sff Hewlett packard board with 4 sata spinners and 1 ssd. 2 spinners and ssd are plugged into the motherboard. 2 spinners are connected to jmb585 sata expansion card in a pcie slot stuck on booting screen and cannot ping the server

edit:cold booted it after hard shutdown and it started without issue with all drives plugged in so not sure

fbelavenuto commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

crazyb123 commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

I used 1.0 beta9,When I finish configuring,reboot. It cannot creat log files. Same settings,I used beta5,USB flash drive root directory have logs folder.

sunhater commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

I do exactly what you say and there is no such folder in any partition. (arpl-1.0-beta9)

root@arpl:/opt/arpl# mount
none on / type rootfs (rw)
devtmpfs on /dev type devtmpfs (rw,relatime)
proc on /proc type proc (rw,relatime)
devpts on /dev/pts type devpts (rw,relatime,gid=5,mode=620,ptmxmode=666)
sysfs on /sys type sysfs (rw,relatime)
/dev/sdg1 on /mnt/p1 type vfat (rw,relatime,fmask=0022,dmask=0022,codepage=437,iocharset=iso8859-1,shortname=mixed,errors=remount-ro)
/dev/sdg2 on /mnt/p2 type ext4 (rw,relatime)
/dev/sdg3 on /mnt/p3 type ext4 (rw,relatime)
root@arpl:/opt/arpl# ls -la /mnt/p1
total 8
drwxr-xr-x 5 root root  512 Jan  1  1970 .
drwxr-xr-x 6 root root    0 Jan  2 13:21 ..
drwxr-xr-x 2 root root  512 Jan  2  2023 .fseventsd
-rwxr-xr-x 1 root root   10 Dec 13 14:25 ARPL-VERSION
drwxr-xr-x 3 root root  512 Dec 13 14:25 EFI
-rwxr-xr-x 1 root root  102 Jan  2 13:16 GRUB_VER
drwxr-xr-x 5 root root  512 Dec 13 14:25 grub
-rwxr-xr-x 1 root root   54 Jan  2 13:16 grub_cksum.syno
-rwxr-xr-x 1 root root 3840 Jan  2 13:21 user-config.yml
root@arpl:/opt/arpl# ls -la /mnt/p2
total 10332
drwxr-xr-x 3 root root    4096 Jan  2 13:16 .
drwxr-xr-x 6 root root       0 Jan  2 13:21 ..
-rw-r--r-- 1 root root     102 Jan  2 13:16 GRUB_VER
-rw-r--r-- 1 root root      54 Jan  2 13:16 grub_cksum.syno
drwx------ 2 root root   16384 Dec 13 14:25 lost+found
-rw-r--r-- 1 root root 7017652 Jan  2 13:16 rd.gz
-rw-r--r-- 1 root root 3513856 Jan  2 13:16 zImage
root@arpl:/opt/arpl# ls -la /mnt/p3
total 140308
drwxr-xr-x  9 root root     4096 Jan  2 13:17 .
drwxr-xr-x  6 root root        0 Jan  2 13:21 ..
-rw-------  1 root root      119 Jan  2 13:24 .bash_history
drwxr-xr-x 13 root root     4096 Dec 13 14:25 addons
-rw-r--r--  1 root root  6110016 Dec 13 14:25 bzImage-arpl
drwxr-xr-x  2 root root     4096 Jan  2 13:16 dl
drwxr-xr-x  2 root root     4096 Jan  2 13:16 extractor
-rw-r--r--  1 root root 49161088 Dec 13 14:25 initrd-arpl
-rw-r--r--  1 root root 72483840 Jan  2 13:17 initrd-dsm
drwxr-xr-x  2 root root     4096 Dec 13 14:25 lkms
drwx------  2 root root    16384 Dec 13 14:25 lost+found
drwxr-xr-x  2 root root     4096 Dec 13 14:25 modules
drwxr-xr-x  2 root root     4096 Jan  2 13:12 ssh
-rw-r--r--  1 root root 15859712 Jan  2 13:17 zImage-dsm
root@arpl:/opt/arpl#
AuxXxilium commented 1 year ago

@fbelavenuto if this will help, i went back to old eudev addon and the system is booting with more than 2 disks but only in sata mode and not with scsi/raid/hba.

with your new eudev: logfile says that all addons are loaded and disks there. but system doesn't boot. have tried nearly everthing the last days.

if i go back to beta5 incl. lkms and addons, the system will boot.

fbelavenuto commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

I used 1.0 beta9,When I finish configuring,reboot. It cannot creat log files. Same settings,I used beta5,USB flash drive root directory have logs folder.

Ok, thanks for report, I'll check the dbgutils log if something break.

fbelavenuto commented 1 year ago

@fbelavenuto if this will help, i went back to old eudev addon and the system is booting with more than 2 disks but only in sata mode and not with scsi/raid/hba.

with your new eudev: logfile says that all addons are loaded and disks there. but system doesn't boot. have tried nearly everthing the last days.

if i go back to beta5 incl. lkms and addons, the system will boot.

Thanks for report. Try to update LKM and addons now, rebuild the loader and test it.

AuxXxilium commented 1 year ago

@fbelavenuto if this will help, i went back to old eudev addon and the system is booting with more than 2 disks but only in sata mode and not with scsi/raid/hba. with your new eudev: logfile says that all addons are loaded and disks there. but system doesn't boot. have tried nearly everthing the last days. if i go back to beta5 incl. lkms and addons, the system will boot.

Thanks for report. Try to update LKM and addons now, rebuild the loader and test it.

Thanks for your work, but it isn't working for me. Same issue with more than 2 drives connected. but hotplug drives after poweron is working. serial (4).zip

edit: found the issue, it only doesn't boot with pvscsi. it is booting with sata only. thanks and it should work for the most with your fixes.

linlinlin530 commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

Hi, I did what you said but there was no such logs folder created into the first partition (beta9). So I try to burn beta5 into usb-flash but still can't boot with 4 SATA disks, then I try beta4, which I could properly run DSM for a period of time before, but failed too. I can't figure out what is the real problem now, I think there is something else causing the problem, I'll try later.

AuxXxilium commented 1 year ago

update: Sata is working for me baremetal and vm. SCSI and Raid doesn't work. even with right sataportmap. so it looks like there is another issue. logs are clear and there is nothing special.

fbelavenuto commented 1 year ago

update: Sata is working for me baremetal and vm. SCSI and Raid doesn't work. even with right sataportmap. so it looks like there is another issue. logs are clear and there is nothing special.

What is the DSM model choosed?

AuxXxilium commented 1 year ago

both broadwellnk. in post above i added the output from serial console.

dtech77pl commented 1 year ago

With the beta10 still, the same problem exists, I can boot only with two HDDs

dtech77pl commented 1 year ago

With beta10a the still problem exist.

dtech77pl commented 1 year ago

on the 10b still the same, can boot only with 2 HDDs

fbelavenuto commented 1 year ago

Please update ARPL, reboot, update modules, addons and lkm, rebuild the loader and test it.

AuxXxilium commented 1 year ago

@fbelavenuto It is the supportraidgroups no in model config. without this tag, all broadwellnk will work. all model config without this, were working since you gone back to your old docker image.

fbelavenuto commented 1 year ago

@fbelavenuto It is the supportraidgroups no in model config. without this tag, all broadwellnk will work. all model config without this, were working since you gone back to your old docker image.

Thanks for the tip. DS3622xs+ works without "supportraidgroups" but RS4021xs+ not! Without "supportraidgroups" the RS4021xs+ do not show SHR RAID option!

AuxXxilium commented 1 year ago

@fbelavenuto It is the supportraidgroups no in model config. without this tag, all broadwellnk will work. all model config without this, were working since you gone back to your old docker image.

Thanks for the tip. DS3622xs+ works without "supportraidgroups" but RS4021xs+ not! Without "supportraidgroups" the RS4021xs+ do not show SHR RAID option!

i know but it destroyes the raid/scsi functions. so we have to choose what we want. shr is not the best option in my eyes. i know many user want it. i think we can find another way for shr on 4021.

dtech77pl commented 1 year ago

Please update ARPL, reboot, update modules, addons and lkm, rebuild the loader and test it.

I follow this instruction and still the problem exists, can boot only if 2HDDs are attached.

fbelavenuto commented 1 year ago

Thanks for the test

AuxXxilium commented 1 year ago

Thanks for the test

just an information: RS4021xs+ doesn't support SHR from Syno side. It is disabled. I bought today one for testing.

image from syno attached. sorry that is german. but in the upper line you see all models without shr support from syno.

image

fbelavenuto commented 1 year ago

Thanks for the test

just an information: RS4021xs+ doesn't support SHR from Syno side. It is disabled. I bought today one for testing.

image from syno attached. sorry that is german. but in the upper line you see all models without shr support from syno.

image

Thanks, I didn't know!

yanyisio commented 1 year ago

beta11a Can't boot with more than 2 SATA drives This problem still exists

Beta 3 it's ok, no probem

luyvlei commented 1 year ago

Same problem when I use ds918+. But ds920+ 7.0.1-42218 works well. While the latest 7.1.1 of ds920+ also boot failed. My hardware is GIGABYTE z490i+i3-10100

aniel300 commented 1 year ago

so this is the issue i originally had and still having right now 😭. have u figured out what the issue is ? @fbelavenuto

fbelavenuto commented 1 year ago

I still don't know what it is, I haven't been able to reproduce the problem on the machines I have access to.

dasb00ter commented 1 year ago

I'm sorry if this is not helpful. My machine did this initially(I let it sit for a couple of hours trying to booti)and since then has restarted every time with 4 disks and an SSD. I remember Jun's loader would do something similar on a completely different machine I had once upon a time. Maybe one of the old timey Devs could give a clue. Still if u can't reproduce it, that's tough.

linlinlin530 commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

Hi, I did what you said but there was no such logs folder created into the first partition (beta9). So I try to burn beta5 into usb-flash but still can't boot with 4 SATA disks, then I try beta4, which I could properly run DSM for a period of time before, but failed too. I can't figure out what is the real problem now, I think there is something else causing the problem, I'll try later.

@fbelavenuto Finally it works on beta3 and my machine has been online for a couple of days, reboot is available and all things run properly. Don't know if this would help.

fbelavenuto commented 1 year ago

@fbelavenuto Finally it works on beta3 and my machine has been online for a couple of days, reboot is available and all things run properly. Don't know if this would help.

Ok, thanks for the report

aniel300 commented 1 year ago

some logs from latest beta logs.zip

crazyb123 commented 1 year ago

@fbelavenuto Tested the latest version,it can not boot. When I use the beta5 eudev module replace it,the latest version can be used. When the supportraidgroup tag is removed,Create a new storage pool,SHR is gone.

DS3622xs+ E3-1265Lv3 asus B85M-E(6 SATA ports.)

aniel300 commented 1 year ago

@fbelavenuto any update on this ?

luyvlei commented 1 year ago

Hi, please go to loader, add the "dbgutils" addon, try to install the DSM with more than 2 disks, wait the problem, shutdown the machine and get the logs from logs folder into first partition of usb flash drive.

Hi, I did what you said but there was no such logs folder created into the first partition (beta9). So I try to burn beta5 into usb-flash but still can't boot with 4 SATA disks, then I try beta4, which I could properly run DSM for a period of time before, but failed too. I can't figure out what is the real problem now, I think there is something else causing the problem, I'll try later.

@fbelavenuto Finally it works on beta3 and my machine has been online for a couple of days, reboot is available and all things run properly. Don't know if this would help.

I tried beta3 and succeeded, thanks.

AuxXxilium commented 1 year ago

@fbelavenuto Tested the latest version,it can not boot. When I use the beta5 eudev module replace it,the latest version can be used. When the supportraidgroup tag is removed,Create a new storage pool,SHR is gone.

DS3622xs+ E3-1265Lv3 asus B85M-E(6 SATA ports.)

DS3622xs+ and RS4021xs+ doesn't support SHR from Synology side.

You can look here: https://global.download.synology.com/download/Document/Hardware/DataSheet/DiskStation/22-year/DS3622xs+/enu/Synology_DS3622xs+_Data_Sheet_enu.pdf

Or here is an overview: https://kb.synology.com/en-ca/DSM/tutorial/Which_models_have_limited_support_for_Synology_Hybrid_RAID_SHR

To make this clear, these models had SHR support in an ARPL release, but Synology hasn't enabled it for these models. If we add SHR support, we will loose any support for SCSI/HBA/Raid controller.

crazyb123 commented 1 year ago

@fbelavenuto Tested the latest version,it can not boot. When I use the beta5 eudev module replace it,the latest version can be used. When the supportraidgroup tag is removed,Create a new storage pool,SHR is gone. DS3622xs+ E3-1265Lv3 asus B85M-E(6 SATA ports.)

DS3622xs+ and RS4021xs+ doesn't support SHR from Synology side.

You can look here: https://global.download.synology.com/download/Document/Hardware/DataSheet/DiskStation/22-year/DS3622xs+/enu/Synology_DS3622xs+_Data_Sheet_enu.pdf

Or here is an overview: https://kb.synology.com/en-ca/DSM/tutorial/Which_models_have_limited_support_for_Synology_Hybrid_RAID_SHR

To make this clear, these models had SHR support in an ARPL release, but Synology hasn't enabled it for these models. If we add SHR support, we will loose any support for SCSI/HBA/Raid controller.

Okay, got it, thanks! But when I delete supportraidgroup ,it still can't boot,

AuxXxilium commented 1 year ago

@fbelavenuto this should fix it: https://github.com/AuxXxilium/arc-addons/commit/f152d1ab0dfd3a62113569f689d68f61e712e827

aniel300 commented 1 year ago

@AuxXxilium how do u know ?

AuxXxilium commented 1 year ago

@AuxXxilium how do u know ?

i testet many different things, to fix the bugs that i found in serial log. without this line the eudev will not load in the right way.

found this fix in systemd community for 4.4.y linux kernel and it is working since 4 days in about 150 different systems.

fbelavenuto commented 1 year ago

@fbelavenuto this should fix it: AuxXxilium/arc-addons@f152d1a

Very thanks!!

fbelavenuto commented 1 year ago

Guys, update the addons, rebuild the loader and test it.

aniel300 commented 1 year ago

can u add an option to update addons, lkm, etc and also skip already downloaded/updated files in one go ?

aniel300 commented 1 year ago

i won't be able to test until couple days time

AuxXxilium commented 1 year ago

can u add an option to update addons, lkm, etc and also skip already downloaded/updated files in one go ?

Update menu in ARPL and update addons.