ravynsoft / ravynos

A BSD-based OS project that aims to provide source and binary compatibility with macOS® and a similar user experience.
https://www.ravynos.com
Other
5.53k stars 183 forks source link

v0.4.0pre4 - /bin/install.sh issues #298

Closed markshank closed 1 year ago

markshank commented 1 year ago

Kernel Panic after boot from SSD

The startup after boot is a hit or miss situation. It might make it to the login screen, and I can logon to the desktop. Or not.

This kernel panic happened after the startup had frozen. Could not ctrl+alt+F2 (or F3). So I held down the power button until it responded with this kernel panic.

IMG_20220717_100407275 - Copy

markshank commented 1 year ago

Frozen

This time the startup logged a bunch of ldconfig garbage and then locked up. In this case, holding down the power button resulted in the screen going black when it finally powered off.

NOTE Sometimes it logs the ldconfig garbage and presses on to the login screen. And sometimes the ldconfig garbage looks like it might come from a copyright text file.

IMG_20220717_105218949 - Copy

mszoek commented 1 year ago

Did we ever figure this out? :)

markshank commented 1 year ago

No

markshank commented 1 year ago

I just booted the most recent nightly (2022-Aug-21 15:18 ravynOS_0.4.0pre5_f14_5654957775912960_amd64.iso) in a libvirt virtual machine on Ubuntu and the ldconfig: lines scrolled continuously on the serial console for minutes before finally stopping. It filled the screen buffer so I can't show all of it.

serial console buffer ldconfig: raminit:13168: trailing characters ignored ldconfig: Kf: No such file or directory ldconfig: raminit:13169: trailing characters ignored ldconfig: �: No such file or directory ldconfig: raminit:13170: trailing characters ignored ldconfig: K<: No such file or directory ldconfig: Kf0.J,: No such file or directory ldconfig: K: No such file or directory ldconfig: raminit:13174: trailing characters ignored ldconfig: �U<�J��J: No such file or directory ldconfig: ��UX: No such file or directory ldconfig: raminit:13176: trailing characters ignored ldconfig: �: No such file or directory ldconfig: raminit:13177: trailing characters ignored ldconfig: gt/t6t: No such file or directory ldconfig: Kf/�0Y: No such file or directory ldconfig:=�/: No such file or directory ldconfig: K: No such file or directory ldconfig: �t�~.�<��~.�J1<3�}��t�<�.: No such file or directory ldconfig: K: No such file or directory ldconfig: K�/�: No such file or directory ldconfig: �_aY: No such file or directory ldconfig: raminit:13185: trailing characters ignored ldconfig: Kf/�: No such file or directory ldconfig: raminit:13186: trailing characters ignored ldconfig: uJf/�: No such file or directory ldconfig: raminit:13187: trailing characters ignored ldconfig: tJf/�: No such file or directory ldconfig: Kf/�2Y: No such file or directory ldconfig: Kf/�2Y: No such file or directory ldconfig: raminit:13190: trailing characters ignored ldconfig: Kf".0.-f.1XK: No such file or directory ldconfig: raminit:13300: trailing characters ignored ldconfig: �u�J��<�X�f<�f�f<�}tf�f��}���f��}��#�f��}��#�f��}��#�f��}��#�f��}��#�f��}��#�f��}����<�2�: File name too long ldconfig: raminit:13403: trailing characters ignored ldconfig: &<�.X�|.�<�<%/3.: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�.�J��: No such file or directory ldconfig: raminit:13411: trailing characters ignored ldconfig: ?�z��h�X�ֺ�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ��|f�~X��af�/+i"f�m�Z.<(=mXXf�~J: File name too long ldconfig:XJ,f��: No such file or directory ldconfig:XJ,f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig: raminit:13423: trailing characters ignored ldconfig: �yf�tXaff/+�"f�m�Z.<(=mXX.�~f: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig: �: No such file or directory ldconfig: �J/f��: No such file or directory ldconfig: �tJ�}f�tX/+w"X�m�L: No such file or directory ldconfig: raminit:13598: trailing characters ignored ldconfig: x�: No such file or directory ldconfig: �X/�<: No such file or directory ldconfig: raminit:13600: trailing characters ignored ldconfig: f.J�. metric 0 mtu 16384 options=680003 inet6 ::1 prefixlen 128 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x2 inet 127.0.0.1 netmask 0xff000000 groups: lo nd6 options=23 vtnet0: flags=8863 metric 0 mtu 1500 options=4c07bb ether 52:54:00:ac:a2:1d inet6 fe80::5054:ff:feac:a21d%vtnet0 prefixlen 64 scopeid 0x1 inet6 2601:5c4:c500:2209:5054:ff:feac:a21d prefixlen 64 autoconf inet 192.168.1.43 netmask 0xffffff00 broadcast 192.168.1.255 media: Ethernet autoselect (10Gbase-T ) status: active nd6 options=23 Starting devd. add host 127.0.0.1: gateway lo0 fib 0: route already in table add host ::1: gateway lo0 fib 0: route already in table add net fe80::: gateway ::1 add net ff02::: gateway ::1 add net ::ffff:0.0.0.0: gateway ::1 add net ::0.0.0.0: gateway ::1 Creating and/or trimming log files. devfs rule: error converting to integer: cups /etc/rc: WARNING: devfs_init_rulesets: could not read rules from /etc/devfs.rules Clearing /tmp. Updating motd:. Updating /var/run/os-release done. Starting psm0 moused. Starting syslogd. Setting date via ntp. 23 Aug 18:32:36 ntpdate[1564]: step time server 108.62.122.57 offset +14400.143149 sec kldload: can't load i915kms: module already loaded or in kernel PlatformLang is en Could not read EFI variable PlatformLang setxkbmap needs to be run from within a graphical user session --> tzsetup America/New_York --> Wrote /var/localize/include Starting dbus. Starting seatd. Mounting late filesystems:. Starting avahi-daemon. Configuring vt: allscreens_kbd blanktime. sysctl: hw.acpi.lid_switch_state="S3": value is not available at line 34 sysctl: unknown oid 'compat.linux.osrelease' at line 39 Starting background file system checks in 60 seconds. Tue Aug 23 14:32:38 EDT 2022
mszoek commented 1 year ago

This doesn't make sense. ldconfig here would be /etc/rc.d/ldconfig during the rc boot sequence. A broken ISO should affect everyone. Why are you the only one seeing this?

It looks like the system does eventually boot. Can you log in? What does grep ldconfig /etc/rc.conf show? If you run sudo /etc/rc.d/ldconfig start does it spew garbage again?

markshank commented 1 year ago

Why are you the only one seeing this?

The original issue was on my Chromebook. It has no serial console, so it roars through the ldconfig: garbage much quicker than the emulated 115200 bps serial console.

It looks like the system does eventually boot. Can you log in?

Not this time, neither the graphic console nor the serial console respond to an Enter key press. Ctrl+Alt+F2 does switch to alternate console, but Enter provokes nothing. Virtual Machine Manager shows the single processor chugging along at approx 50%.

If I tried enough times, I might get lucky based on my original experience with the Chromebook. But it takes so long and the CPU is pegged while it is spewing the ldconfig: stuff.

image

image

serial console buffer ldconfig: raminit:13168: trailing characters ignored ldconfig: Kf: No such file or directory ldconfig: raminit:13169: trailing characters ignored ldconfig: �: No such file or directory ldconfig: raminit:13170: trailing characters ignored ldconfig: K<: No such file or directory ldconfig: Kf0.J,: No such file or directory ldconfig: K: No such file or directory ldconfig: raminit:13174: trailing characters ignored ldconfig: �U<�J��J: No such file or directory ldconfig: ��UX: No such file or directory ldconfig: raminit:13176: trailing characters ignored ldconfig: �: No such file or directory ldconfig: raminit:13177: trailing characters ignored ldconfig: gt/t6t: No such file or directory ldconfig: Kf/�0Y: No such file or directory ldconfig:=�/: No such file or directory ldconfig: K: No such file or directory ldconfig: �t�~.�<��~.�J1<3�}��t�<�.: No such file or directory ldconfig: K: No such file or directory ldconfig: K�/�: No such file or directory ldconfig: �_aY: No such file or directory ldconfig: raminit:13185: trailing characters ignored ldconfig: Kf/�: No such file or directory ldconfig: raminit:13186: trailing characters ignored ldconfig: uJf/�: No such file or directory ldconfig: raminit:13187: trailing characters ignored ldconfig: tJf/�: No such file or directory ldconfig: Kf/�2Y: No such file or directory ldconfig: Kf/�2Y: No such file or directory ldconfig: raminit:13190: trailing characters ignored ldconfig: Kf".0.-f.1XK: No such file or directory ldconfig: raminit:13300: trailing characters ignored ldconfig: �u�J��<�X�f<�f�f<�}tf�f��}���f��}��#�f��}��#�f��}��#�f��}��#�f��}��#�f��}��#�f��}����<�2�: File name too long ldconfig: raminit:13403: trailing characters ignored ldconfig: &<�.X�|.�<�<%/3.: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�.�J��: No such file or directory ldconfig: raminit:13411: trailing characters ignored ldconfig: ?�z��h�X�ֺ�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ��zf�t�"�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�g�X�ֺ�zf�tt�ztJ�h�X�ֺ�zf�tt�ztJ��|f�~X��af�/+i"f�m�Z.<(=mXXf�~J: File name too long ldconfig:XJ,f��: No such file or directory ldconfig:XJ,f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig: raminit:13423: trailing characters ignored ldconfig: �yf�tXaff/+�"f�m�Z.<(=mXX.�~f: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ�: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig:XJ/f��: No such file or directory ldconfig: �: No such file or directory ldconfig: �J/f��: No such file or directory ldconfig: �tJ�}f�tX/+w"X�m�L: No such file or directory ldconfig: raminit:13598: trailing characters ignored ldconfig: x�: No such file or directory ldconfig: �X/�<: No such file or directory ldconfig: raminit:13600: trailing characters ignored ldconfig: f.J�. metric 0 mtu 16384 options=680003 inet6 ::1 prefixlen 128 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x2 inet 127.0.0.1 netmask 0xff000000 groups: lo nd6 options=23 vtnet0: flags=8863 metric 0 mtu 1500 options=4c07bb ether 52:54:00:ac:a2:1d inet6 fe80::5054:ff:feac:a21d%vtnet0 prefixlen 64 scopeid 0x1 inet6 2601:5c4:c500:2209:5054:ff:feac:a21d prefixlen 64 autoconf inet 192.168.1.43 netmask 0xffffff00 broadcast 192.168.1.255 media: Ethernet autoselect (10Gbase-T ) status: active nd6 options=23 Starting devd. add host 127.0.0.1: gateway lo0 fib 0: route already in table add host ::1: gateway lo0 fib 0: route already in table add net fe80::: gateway ::1 add net ff02::: gateway ::1 add net ::ffff:0.0.0.0: gateway ::1 add net ::0.0.0.0: gateway ::1 Creating and/or trimming log files. devfs rule: error converting to integer: cups /etc/rc: WARNING: devfs_init_rulesets: could not read rules from /etc/devfs.rules Clearing /tmp. Updating motd:. Updating /var/run/os-release done. Starting psm0 moused. Starting syslogd. Setting date via ntp. 24 Aug 10:53:28 ntpdate[1564]: step time server 69.89.207.199 offset +14400.145274 sec kldload: can't load i915kms: module already loaded or in kernel PlatformLang is en Could not read EFI variable PlatformLang setxkbmap needs to be run from within a graphical user session --> tzsetup America/New_York --> Wrote /var/localize/include Starting dbus. Starting seatd. Mounting late filesystems:. Starting avahi-daemon. Configuring vt: allscreens_kbd blanktime. sysctl: hw.acpi.lid_switch_state="S3": value is not available at line 34 sysctl: unknown oid 'compat.linux.osrelease' at line 39 Starting background file system checks in 60 seconds. Wed Aug 24 06:53:28 EDT 2022
markshank commented 1 year ago

The mediated passthrough of the Intel GPU may be introducing a special case. Especially since i915kms.ko thinks it has a physical GPU which causes it to throw all kinds of errors when the mediated passthrough GPU doesn't cooperate with it.

Still doesn't explain the original Chromebook issue though.

mszoek commented 1 year ago

Have you verified the image MD5 sum? It feels like something is corrupted but I can't think what it could be. ldconfig appears to be reading the contents of /raminit (a binary file) and version.txt among other things. It should only do that if those files are listed either in rc.conf or are in a directory listed in rc.conf or where it normally looks.

markshank commented 1 year ago

I used the Proxmox iso download utility which checked the MD5 sum I specified after it downloaded the iso. Then I used scp to copy it to the standalone Ubuntu box.

markshank commented 1 year ago

Just tried this on a Proxmox VM with no serial, "standard VGA" and 4 vCPUs. No trouble found.

Unless the "Illegal byte sequence" below means something:

image

kjarex commented 1 year ago

I am 99% sure, I had the exact same issues SOMETIMES (weirdly, not with every boot nor with any recognisable trigger). It eventually disappeared (after maybe 20 fails) without me having changed anything proactively.

edit: by "the exact same" I mean the initial screenshots and errors listed in the serial console buffer

mszoek commented 1 year ago

I noticed that "illegal byte sequence" on my new install as well. It seems harmless - not sure where it's coming from yet.

drumttocs8 commented 1 year ago

What were your other settings using Proxmox? Mine seems to hang at "launchd 1"...

markshank commented 1 year ago

What were your other settings using Proxmox?

root@epyc3000:\~# qm config 132 balloon: 0 bios: ovmf boot: order=ide2;scsi0 cores: 4 efidisk0: CephPool:vm-132-disk-1,efitype=4m,size=528K ide2: cephfs:iso/ravynOS_0.4.0pre5_f14_5654957775912960_amd64.iso,media=cdrom,size=1457042K machine: q35 memory: 4096 name: airyxlivecd net0: virtio=16:7D:D7:B3:1A:D9,bridge=vmbr0 numa: 0 ostype: other scsi0: CephPool:vm-132-disk-0,size=15G scsihw: virtio-scsi-pci smbios1: uuid=5c7e79be-0cbc-4a9f-8bd0-76af5949322d sockets: 1 vga: std vmgenid: 716daf0f-a9bb-485f-9097-1136b38717b1 root@epyc3000:~#

Mine seems to hang at "launchd 1"

Graphics are not supported on VMs yet. Work in progress on drivers for that. This is what I see:

image

mszoek commented 1 year ago

I chanced into reproducing the 'ldconfig' issue and was able to capture the top of the output. This is it:

kldload: can't load utouch: No such file or directory
/etc/rc: WARNING: Unable to load kernel module utouch
^Blo0: link state changed to UP
ELF ldconfig path: /lib /usr/lib /usr/lib/compat /usr/lib/compat/pkg /usr/lib/perl5/5.32/mach/CORE /usr/lib/qt5 /usr/llvm13/lib Applications Library System Users Volumes bin boot cdrom compat dev etc init.sh lib libexec media mnt private proc raminit rescue root sbin sysroot tmp usr var version . version version version version version version version version version version version version version version version version version version version version version Applications Library System Users Volumes bin boot cdrom compat dev etc init.sh lib libexec media mnt private proc raminit rescue root sbin sysroot tmp usr var version version version version version version Applications Library System Users Volumes bin boot cdrom compat dev etc init.sh lib libexec media mnt private proc raminit rescue root sbin sysroot tmp usr var version // // Applications Library System Users Volumes bin boot cdrom compat dev etc init.sh lib libexec media mnt private proc raminit rescue root sbin sysroot tmp usr var version version version version version version . . . . . . . . .

So the actual behavior we're seeing makes sense - ldconfig is iterating that list of directories. What doesn't make sense is WHY it has that list.

mszoek commented 1 year ago

I think it is caused by the presence of /usr/lib/qt5 in the ldconfig_paths entry in /etc/rc.conf. That has been there for a while so I'm not sure why it suddenly causes problems, but removing it seems to fix this - and it makes sense that reading those shared objects as text files containing paths would blow up.