minershive / hiveos-pxe-diskless

Network boot for diskless rigs
38 stars 26 forks source link

8-16 - fresh download doesn't build pxe botable image #29

Open bmartino1 opened 2 months ago

bmartino1 commented 2 months ago

minershive/hiveos-pxe-diskless

Following guide: https://hiveon.com/forum/t/hive-os-diskless-pxe/12319/2

I kept defaults.

I run the script and command per readme... /deploy... ubuntu20 --build command completed up until error:

[16/08/2024 18:09:00][DEPLOY_PXE] Saving to build/ubuntu20/ubuntu20.tar.xz ...../deploy_pxe: line 760: 39101 Broken pipe tar -C $DIR --exclude='boot/' --exclude='usr/src/' --exclude='dev/*' -cpf - . 39102 Killed | pixz -9 -e > $filepath [FAIL] root@pxe-diskless:/pxeserver# uname -r 5.4.0-192-generic root@pxe-diskless:/pxeserver# lsb_release -l Usage: lsb_release [options]

lsb_release: error: no such option: -l root@pxe-diskless:/pxeserver# lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 20.04.6 LTS Release: 20.04 Codename: focal root@pxe-diskless:/pxeserver#

image

Can't build Ubuntu image to run on pxe server...

no other info on where to store other images and how to boot to test... --Netboot directory for x86_64-efi created. Configure your DHCP server to point to /pxeserver/tftp/efi/x86_64-efi/core.efi

tftp never loads pxe file... lookt into hive clone deploy ipxe boot files for better deploy system...

bmartino1 commented 2 months ago

also nvidia build broken as well...

root@pxe-diskless:/pxeserver# ./deploy_pxe ubuntu --build Usage: deploy_pxe ubuntu20 --build create latest Ubuntu 20.04 image deploy_pxe ubuntu20 --selfupgrade just upgrade Hive package and repack roo tfs image deploy_pxe ubuntu20 --upgrade upgrade all and repack rootfs image deploy_pxe ubuntu20 --chroot chroot into rootfs (for manual actions) deploy_pxe ubuntu20 --initrd rebuild initramfs image deploy_pxe ubuntu20 --remove delete rootfs folder

Nvidia drivers: deploy_pxe nvidia --list list available driver versions deploy_pxe nvidia --build build driver specific version ( e.g. 515 or 515.105 or 515.105.01)

AMD OpenCL: deploy_pxe opencl --list list available driver versions deploy_pxe opencl --build build specific version (for now 5.4 only

nvidia list works, but can't build nvidia script never completes nor runs.

may be time for a refresh...

bmartino1 commented 2 months ago

editing the deploy script and changing the piz compression form -9 to -5 allowed the script top finish and make files. https://github.com/minershive/hiveos-pxe-diskless/blob/6de7c608abc22d2a9026a28755b4dbfcbc8c677e/pxeserver/deploy_pxe#L635

Pxe boot to hive diskless and now error with boot init issues...

code change: deploy_fs() { local dir_size=du -hsm $DIR | awk '{print $1}' | tr -d '\r\n' local filepath=build/$dist/$dist.tar.xz echo2 "Directory size: ${dir_size}M" && echo_ok

    echo2 "Saving to $filepath ..."
    tar -C $DIR --exclude='boot/' --exclude='usr/src/' --exclude='dev/*' -                                                                                                                                                             cpf - . | pixz -5 -e > $filepath
    [[ $? -ne 0 ]] && echo_fail  echo_ok
        echo2 "Create symlink ..."
        rm -rf hiveramfs/$dist.tar.xz && ln -s ../$filepath hiveramfs/$dist.                                                                                                                                                             tar.xz
    [[ $? -ne 0 ]] && echo_fail  echo_ok

}

echo "" echo ""

then script apears to have finished with out issue. -9 is a depper compression

Not sure if this compression is breaking diskless boot though...

bmartino1 commented 2 months ago

regardless, a fresh download following the guide doesn't work...

image

villos commented 1 month ago

Hi, bmartino1 About the first screenshot: Please send pxe server OS config (uname -a, CPU, amount of RAM memory) In some cases compess with -9 option crashed when amount of RAM is small

About the last screenshot: Please send rig config: (motherboard model, ethernet adapter model) Judging by the screenshot, linux kernel on initramfs not found ethernet adapter(possibly missing module in intramfs?) and so DHCP client doesn't start propertly. BR