Open sarg3nt opened 6 months ago
Based on the logs provided, it seems there is an issue with adding a line to the bashrc files on the system. Specifically, the line:
export PATH="${PATH}:/var/lib/rancher/rke2/bin/"
is failing to be added. This can be seen in the '/home/kairos/.bashrc' and '/home/kairos/.profile' files. The error message:
/bin/sh: line 1: [[: 0[0]: syntax error: invalid arithmetic operator (error token is "[0]")
indicates a syntax error in the bash script.
It is recommended to manually check the files '/home/kairos/.bashrc' and '/home/kairos/.profile' to confirm whether the line in question is present and if there are any syntax errors.
Additionally, ensure that the '/var/lib/rancher/rke2/bin/' directory exists and has executable files.
Once you have confirmed these details, you may remove the offending line from the bashrc files as it does not appear to be a necessary addition.
If the issue persists after these steps, further investigation into the code and environment may be needed.
Regarding what @ci-robbot noticed. The lines that add the path statement are fine. Both /home/kairos/.bashrc
and /home/kairos/.profile
are fine. The problem was some left over logging code that was not working. I removed it and reran the test, did not help, which I figured would be the case as this was the same code that worked fine in bios
mode.
Could you provide the updated logs after removing the logging code and rerunning the test?
@ci-robbot Okey dokey Mr bot sir.
[root@lpul-vault-k8s-server-0 immucore]# cat initramfs_stage.log
2024-05-03T00:35:19Z INF Running stage: initramfs.before
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ ! -f /oem/userdata ]: exit status 1)' stage name: Pull data from provider
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -e /sbin/openrc ]: exit status 1)' stage name: Blacklist bpfilter on Alpine ( bug: https://github.com/kairos-io/kairos/issues/277 )
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run ! [[ -f /etc/hosts ]] || ! [[ $(grep '127.0.0.1' /etc/hosts) ]]
: exit status 1)' stage name: Make sure hosts file is present and includes a record for 127.0.0.1
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ ! -f /oem/userdata ]: exit status 1)' stage name:
2024-05-03T00:35:19Z INF Done executing stage 'initramfs.before'
2024-05-03T00:35:19Z INF Running stage: initramfs
2024-05-03T00:35:19Z INF Processing stage step ''. ( commands: 1, files: 0, ... )
2024-05-03T00:35:19Z INF Processing stage step 'Enable systemd-network config files for DHCP'. ( commands: 1, files: 2, ... )
2024-05-03T00:35:19Z INF Processing stage step 'systemd-sysext initramfs settings'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run grep -q "kairos.remote_recovery_mode" /proc/cmdline && \
( [ -e "/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] || [ -e "/usr/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] )
: exit status 1)' stage name: Starts kairos-recovery and generate a temporary pass
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -f "/sbin/openrc" ]
: exit status 1)' stage name: Create OpenRC services
2024-05-03T00:35:19Z INF Processing stage step 'Create journalctl /var/log/journal dir'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:19Z ERR Failed to connect system bus: No such file or directory
: failed to run networkctl reload: exit status 1
2024-05-03T00:35:19Z ERR 1 error occurred:
* failed to run networkctl reload: exit status 1
2024-05-03T00:35:19Z INF Command output: Created symlink /etc/systemd/system/multi-user.target.wants/kairos-agent.service → /etc/systemd/system/kairos-agent.service.
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ ! -f "/run/cos/recovery_mode" ] && [ -s /usr/local/etc/machine-id ]: exit status 1)' stage name: Restore /etc/machine-id for systemd systems
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ ! -f "/run/cos/recovery_mode" ] && [ ! -f "/run/cos/live_mode" ]: exit status 1)' stage name:
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -f "/sbin/openrc" ]
: exit status 1)' stage name: Enable OpenRC services
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run grep -q "kairos.remote_recovery_mode" /proc/cmdline && [ -f "/sbin/openrc" ]: exit status 1)' stage name: Starts kairos-recovery for openRC based systems
2024-05-03T00:35:19Z INF Processing stage step 'Disable NetworkManager and wicked'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:19Z INF Processing stage step ''. ( commands: 0, files: 2, ... )
2024-05-03T00:35:19Z ERR 2 errors occurred:
* failed to run systemctl disable NetworkManager: exit status 1
* failed to run systemctl disable wicked: exit status 1
2024-05-03T00:35:19Z INF Processing stage step 'Enable systemd-network and systemd-resolved'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ ! -f "/run/cos/recovery_mode" ] && [ -f "/sbin/openrc" ]: exit status 1)' stage name: Restore /etc/machine-id for openrc systems
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run (grep -qv "interactive-install" /proc/cmdline || grep -qv "install-mode-interactive" /proc/cmdline) && \
[ -f /run/cos/live_mode ] && \
[ -f "/sbin/openrc" ]
: exit status 1)' stage name: Autologin on livecd for OpenRC
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run (grep -q "kairos.reset" /proc/cmdline || [ -f /run/cos/autoreset_mode ]) && \
( [ -e "/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] || [ -e "/usr/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] )
: exit status 1)' stage name: Starts kairos-reset for systemd based systems
2024-05-03T00:35:19Z INF Processing stage step 'Default systemd config'. ( commands: 1, files: 0, ... )
2024-05-03T00:35:19Z INF Command output: Created symlink /etc/systemd/system/default.target → /usr/lib/systemd/system/multi-user.target.
2024-05-03T00:35:19Z ERR 5 errors occurred:
* failed to run systemctl enable systemd-timesyncd: exit status 1
* failed to run systemctl enable nohang: exit status 1
* failed to run systemctl enable nohang-desktop: exit status 1
* failed to run systemctl enable fail2ban: exit status 1
* failed to run systemctl enable logrotate.timer: exit status 1
2024-05-03T00:35:19Z INF Processing stage step 'Generate host keys'. ( commands: 1, files: 0, ... )
2024-05-03T00:35:19Z INF Processing stage step 'Link /etc/resolv.conf to systemd resolv.conf'. ( commands: 2, files: 0, ... )
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run grep -q "kairos.reset" /proc/cmdline && [ -f "/sbin/openrc" ]: exit status 1)' stage name: Starts kairos-reset for openRC-based systems
2024-05-03T00:35:19Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run cat /proc/cmdline | grep "selinux=1"
: exit status 1)' stage name: Relabelling
2024-05-03T00:35:19Z INF Command output:
2024-05-03T00:35:19Z INF Command output:
2024-05-03T00:35:20Z INF Command output: ssh-keygen: generating new host keys: RSA DSA ECDSA ED25519
2024-05-03T00:35:20Z INF Processing stage step 'Create systemd services'. ( commands: 0, files: 5, ... )
2024-05-03T00:35:20Z INF Processing stage step ''. ( commands: 5, files: 0, ... )
2024-05-03T00:35:20Z INF Command output: Removed "/etc/systemd/system/getty.target.wants/getty@tty1.service".
2024-05-03T00:35:20Z INF Command output: Running in chroot, ignoring command 'stop'
2024-05-03T00:35:20Z INF Command output: Created symlink /etc/systemd/system/getty@tty1.service → /dev/null.
2024-05-03T00:35:20Z INF Command output: Created symlink /etc/systemd/system/multi-user.target.wants/kairos.service → /etc/systemd/system/kairos.service.
2024-05-03T00:35:20Z INF Command output: Created symlink /etc/systemd/system/multi-user.target.wants/kairos-webui.service → /etc/systemd/system/kairos-webui.service.
2024-05-03T00:35:20Z INF Processing stage step 'Enable systemd services'. ( commands: 4, files: 0, ... )
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run (grep -q "install-mode" /proc/cmdline || grep -q "nodepair.enable" /proc/cmdline ) && \
([ -f /run/cos/live_mode ] || [ -f /run/cos/uki_install_mode ]) && \
[ -f "/sbin/openrc" ]
: exit status 1)' stage name:
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z INF Processing stage step 'Setup groups'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run (grep -q "interactive-install" /proc/cmdline || grep -q "install-mode-interactive" /proc/cmdline) && \
([ -f /run/cos/live_mode ] || [ -f /run/cos/uki_install_mode ]) && \
( [ -e "/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] || [ -e "/usr/sbin/systemctl" ] || [ -e "/usr/bin/systemctl" ] )
: exit status 1)' stage name:
2024-05-03T00:35:20Z INF Processing stage step 'Setup users'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run (grep -q "interactive-install" /proc/cmdline || grep -q "install-mode-interactive" /proc/cmdline) && \
([ -f /run/cos/live_mode ] || [ -f /run/cos/uki_install_mode ]) && \
[ -f "/sbin/openrc" ]
: exit status 1)' stage name:
2024-05-03T00:35:20Z INF Processing stage step 'Set user password if running in live or uki'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z INF Processing stage step 'Setup sudo'. ( commands: 1, files: 1, ... )
2024-05-03T00:35:20Z INF Command output: Locking password for user root.
passwd: Success
2024-05-03T00:35:20Z INF Processing stage step 'Ensure runtime permission'. ( commands: 2, files: 0, ... )
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z INF Command output:
2024-05-03T00:35:20Z INF Processing stage step ''. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -e "/usr/local/cloud-config" ]: exit status 1)' stage name: Ensure runtime permission
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -f "/sys/firmware/devicetree/base/model" ] && grep -i jetson "/sys/firmware/devicetree/base/model"
: exit status 1)' stage name: Create files
2024-05-03T00:35:20Z INF Processing stage step ''. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z INF Processing stage step 'Set hostname'. ( commands: 0, files: 0, ... )
2024-05-03T00:35:20Z INF Processing stage step 'Run commands'. ( commands: 1, files: 0, ... )
2024-05-03T00:35:20Z INF Command output: 2024-05-03 00:35:20 Add DHCP ClientIdentifier=mac to network config if not already present.
2024-05-03 00:35:20 Adding line [DHCP] to file /etc/systemd/network/20-dhcp.network
2024-05-03 00:35:20 Adding line ClientIdentifier=mac to file /etc/systemd/network/20-dhcp.network
2024-05-03 00:35:20 Adding line [DHCP] to file /etc/systemd/network/20-dhcp-legacy.network
2024-05-03 00:35:20 Adding line ClientIdentifier=mac to file /etc/systemd/network/20-dhcp-legacy.network
2024-05-03 00:35:20 Add ll to the root and Kairos .bashrc if not already present.
2024-05-03 00:35:20 Adding line alias ll="ls -alh" to file /root/.bashrc
2024-05-03 00:35:20 Creating new file /home/kairos/.bashrc with line alias ll="ls -alh"
2024-05-03 00:35:20 Creating new file /home/kairos/.profile with line alias ll="ls -alh"
2024-05-03 00:35:20 Add rke2 bin to the path.
2024-05-03 00:35:20 Adding line export PATH="${PATH}:/var/lib/rancher/rke2/bin/" to file /root/.bashrc
2024-05-03 00:35:20 Adding line export PATH="${PATH}:/var/lib/rancher/rke2/bin/" to file /home/kairos/.bashrc
2024-05-03 00:35:20 Adding line export PATH="${PATH}:/var/lib/rancher/rke2/bin/" to file /home/kairos/.profile
2024-05-03T00:35:20Z INF Done executing stage 'initramfs'
2024-05-03T00:35:20Z INF Running stage: initramfs.after
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [ -e /sbin/openrc ]: exit status 1)' stage name: Enable serial login for alpine
2024-05-03T00:35:20Z WRN (conditional) Skip 'Skipping stage (if statement error: failed to run [[ $(kairos-agent state get kairos.flavor) =~ ^ubuntu ]]: exit status 1)' stage name: setupcon initramfs.after ubuntu
2024-05-03T00:35:20Z INF Done executing stage 'initramfs.after'
2024-05-03T00:35:20Z INF Running stage: initramfs.before
2024-05-03T00:35:20Z INF Done executing stage 'initramfs.before'
2024-05-03T00:35:20Z INF Running stage: initramfs
2024-05-03T00:35:20Z INF Done executing stage 'initramfs'
2024-05-03T00:35:20Z INF Running stage: initramfs.after
2024-05-03T00:35:20Z INF Done executing stage 'initramfs.after'
Error in script: /init: 22: /init: /sbin/touch: not found
Tried building an ISO of our current customized OS and installing it manually on a vsphere node with the firmware set to efi and I get the following error:
I tried BIOS and it worked fine. Am I missing something?
Our custom container image starts with /kairos/rockylinux:9-core-amd64-generic-master
and installs a few packages, rke2 and some security software. Nothing that would dork over the kernel that I can think of. Especially not in a way that would break it for EFI but not BIOS.
Here's the build command. It's right out of the docs.
#!/bin/bash
docker run -v "$PWD"/build:/tmp/auroraboot \
-v /var/run/docker.sock:/var/run/docker.sock \
--rm -ti quay.artifactory.metro.ad.selinc.com/kairos/auroraboot:v0.2.7 \
--set container_image=docker://<snip> \
--set "disable_http_server=true" \
--set "disable_netboot=true" \
--set "state_dir=/tmp/auroraboot"
It seems there was an issue with the installation process. The /var/log/installer/syslog
and /run/immucore/rootfs_stage.log
logs may provide information about the errors encountered during the installation.
In the /var/log/installer/syslog
log, you can see several errors related to connecting to the system bus and reloading the network configuration. It's possible that the installation environment is not fully set up, or there is a problem with the networking configuration.
In the /run/immucore/rootfs_stage.log
, you can see errors related to creating directories and connecting to the system bus. It's also mentioned that the installation process is skipping certain stages due to various if statements not matching the current conditions.
To troubleshoot the issue, you can try the following steps:
If the issue persists, it may be helpful to seek further assistance from the software's official support channels or community forums, providing the relevant logs and details about your system configuration.
Kairos version:
A build from Master from
kairos/rockylinux:9-core-amd64-generic-master
on April 29th, 2024CPU architecture, OS, and Version:
Intel, vSphere VM, rocky linux 9 core with add-ons.
Describe the bug
Works when VM is in BIOS mode but when switched to EFI (VSpheres term for the firmware) it crashes on install.
To Reproduce
You don't have access to vSphere so . . . . let me be your hands. :)
Expected behavior
It works?
Additional context
no-format: true
set and are creating our own boot partitions for Kairos, however switching this back to "normal" has no effect. It looks like it's not getting to this part of the config at all.kairos/rockylinux:9-core-amd64-generic-master
. Master because the fixes to makeno-format
work are not in a release yet. The custom changes addrke2
and some security tools.Logs
Debug mode is enabled with
AuroraBoot
AuroraBoot is serving all the files properly and it looks like the downstream VM is getting them.
Node is stopping on startup:
Setting
auto: false
andreboot: false
does not change anything but I'm able to SSH to the partially booted node. Some customization's are being ran. i.e. the machine name is being set by thecloud_init.yaml
file that is served to the node via vSphere customization. From first inspection it looks likeinitramfs
is being ran but the node password is not being set, nor are theboot
orwrite_files
steps. See: https://github.com/kairos-io/kairos/issues/2281#issuecomment-2078014965 The files in that post are the relevant bits. and https://github.com/kairos-io/kairos/issues/2516 for context.journalctl -u kairos-agent
Logs in
/run/kairos
There are a bunch of log files in
/run/kairos
but as far as I can tell they all say the same thing./run/immucore/immucore.log
/run/immucore/initramfs_stage.log
/run/immucore/rootfs_stage.log