Open jlec opened 1 year ago
This is a bug with U-Boot, not Talos. The former is configuring an inappropriate power management mode for the CM4's PCI Express root controller, which causes the TP2's ASM1061 to drop off the bus, resulting in a CPU abort when U-Boot attempts to scan it.
I'll be sending a patch for this to U-Boot shortly. This issue can be closed once Talos begins using a fixed U-Boot.
@CFSworks could you add a link to the U-Boot PR here, so we can track that progress here as well?
Oh! Yes, certainly, here you go: pci: pcie-brcmstb: do not rely on CLKREQ# signal
It looks like it will first be included in v2024.01-rc1
.
Note that a fixed U-Boot allows the boot to progress a little further and encounter a kernel panic from Linux's version of the same problem. There's a patch out there that works (everyone I've seen encounter that panic who then tried that patch reported success with it), but it apparently hasn't yet been accepted into the kernel.
The new version of u-boot (v2024.01
) has been released two weeks ago. Any ETA to have Talos released with it?
@SheGe I'm not sure about ETAs, but the new U-boot version requires lots of SBC compatibility changes so might be time-consuming to test. No idea whether the kernel patch would be available either
The bug triggers also if you add a mini PCI-Express to Sata adapter to node 1 or 2.
should this be fixed now with https://github.com/siderolabs/sbc-raspberrypi/pull/7 and is this merge already released?
See https://www.talos.dev/v1.7/talos-guides/install/single-board-computers/rpi_generic/, try with latest version (v1.7.0-beta.0) at the moment.
I just tried the latest version. I get past uboot and grub. But the linux kernel still seems to crash.
@torvitas the kernel patch hasn't been accepted yet, but v8 was close and v9 was submitted a few weeks ago. Talos uses LTS kernel versions, usually the last stable release in a year, so this might not be fixed for a while. It's planned to be fixed in the Turing Pi v2.5 though.
As far as I understood the Turing Pi v2.5 is a different piece of hardware than the Turing Pi v2.
Is there a way to have custom kernel patches for the sbc images?
Yeah, v2.5 is a new hardware revision. Overlays don't include kernel patches, but the docs have a guide for customising the kernel. Then you can use an overlay with the custom imager to generate boot assets.
I just want to report that it works. All three cm4 modules on my tp2 are up and running.
I just want to report that it works. All three cm4 modules on my tp2 are up and running.
@torvitas Would you mind to quickly share how you did it? Last time I tried it, I wasn't successful.
@tuxpeople - I basically did what @pl4nty suggested.
It was roughly like that: Clone the pkgs repo, add patches to kernel/prepare/patches
, references these in kernel/pkg.yaml
. Then follow https://www.talos.dev/v1.7/advanced/customizing-the-kernel/ .
Bug Report
The node 3 is connect to an ASM1061 chip. This is apparent the reason the node is not booting. Most likely cause is the boot order which prefers the SATA disks over the eMMC.
Description
The Adapter and CM4 have been validated in other slots to work with the same Talos image to work and boot normally. Just in node 3 slot it won't come up. Other linux distros like Ubuntu work fine with the same board.
Logs
serial.log:
Environment