okd-project / okd

The self-managing, auto-upgrading, Kubernetes distribution for everyone
https://okd.io
Apache License 2.0
1.76k stars 297 forks source link

Missing libnettle.so.8 shared library with agent-based install OKD 4.12 #1536

Closed linuxeroagrio closed 2 months ago

linuxeroagrio commented 1 year ago

Describe the bug

In preparing-for-installation phase for agent-based installation with a single node cluster, the host status changed to insufficent in the NTP validation step. The logs show that was a missed shared library.

Version

4.12.0-0.okd-2023-03-05-022504

How reproducible

100%

Log bundle

Sample for assisted-service.service log

[core@master-00 ~]$ journalctl -u assisted-service.service

Mar 15 18:48:46 master-00.sno-okd.linuxero-agrio.io service[2200]: time="2023-03-15T18:48:46Z" level=info msg="GetNextSteps infra_env: <75469dda-8e1b-4176-87ec-4ccf718b885f>, host: <8d40f50d-f63e-404d-9fe1-d62cdf54d5c4>, host status: <insufficient>" func="github.com/openshift/assisted-service/internal/host/hostcommands.(*InstructionManager).GetNextSteps" file="/src/internal/host/hostcommands/instruction_manager.go:158" go-id=1190 host_id=8d40f50d-f63e-404d-9fe1-d62cdf54d5c4 infra_env_id=75469dda-8e1b-4176-87ec-4ccf718b885f pkg=instructions request_id=8fd71058-60c4-4478-ac8e-6699af7c707d

Mar 15 18:48:46 master-00.sno-okd.linuxero-agrio.io service[2200]: time="2023-03-15T18:48:46Z" level=error msg="Received step reply <ntp-synchronizer-5c0a809f> from infra-env <75469dda-8e1b-4176-87ec-4ccf718b885f> host <8d40f50d-f63e-404d-9fe1-d62cdf54d5c4> exit-code <-1> stderr <chronyc exited with non-zero exit code 127: \nchronyc: error while loading shared libraries: libnettle.so.8: cannot open shared object file: No such file or directory\n> stdout <>" func=github.com/openshift/assisted-service/internal/bminventory.logReplyReceived file="/src/internal/bminventory/inventory.go:3287" go-id=1190 host_id=8d40f50d-f63e-404d-9fe1-d62cdf54d5c4 infra_env_id=75469dda-8e1b-4176-87ec-4ccf718b885f pkg=Inventory request_id=88ac0e76-38ac-45bd-af41-1f7863ca247a

Sample for agent wait-for boostrap-complete output:

[jorge@centos-stream-virtual02 sno-okd-agent-based-installer]$ openshift-install agent wait-for bootstrap-complete --dir=sno-okd --log-level=debug
DEBUG asset directory: sno-okd                     
DEBUG Loading Agent Config...                      
DEBUG Using Agent Config loaded from state file    
DEBUG Loading Agent Manifests...                   
DEBUG   Loading Agent PullSecret...                
DEBUG     Loading Install Config...                
DEBUG     Using Install Config loaded from state file 
DEBUG   Using Agent PullSecret loaded from state file 
DEBUG   Loading InfraEnv Config...                 
DEBUG     Loading Install Config...                
DEBUG     Loading Agent Config...                  
DEBUG   Using InfraEnv Config loaded from state file 
DEBUG   Loading NMState Config...                  
DEBUG     Loading Agent Config...                  
DEBUG     Loading Install Config...                
DEBUG   Using NMState Config loaded from state file 
DEBUG   Loading AgentClusterInstall Config...      
DEBUG     Loading Install Config...                
DEBUG   Using AgentClusterInstall Config loaded from state file 
DEBUG   Loading ClusterDeployment Config...        
DEBUG     Loading Install Config...                
DEBUG   Using ClusterDeployment Config loaded from state file 
DEBUG   Loading ClusterImageSet Config...          
DEBUG     Loading Release Image Pull Spec...       
DEBUG     Using Release Image Pull Spec loaded from state file 
DEBUG     Loading Install Config...                
DEBUG   Using ClusterImageSet Config loaded from state file 
DEBUG Using Agent Manifests loaded from state file 
DEBUG RendezvousIP from the AgentConfig 192.168.122.104 
DEBUG Agent Rest API never initialized. Bootstrap Kube API never initialized 
INFO Waiting for cluster install to initialize. Sleeping for 30 seconds 
DEBUG Agent Rest API never initialized. Bootstrap Kube API never initialized 
INFO Waiting for cluster install to initialize. Sleeping for 30 seconds 
DEBUG Agent Rest API never initialized. Bootstrap Kube API never initialized 
INFO Waiting for cluster install to initialize. Sleeping for 30 seconds 
DEBUG Agent Rest API never initialized. Bootstrap Kube API never initialized 
INFO Waiting for cluster install to initialize. Sleeping for 30 seconds 
DEBUG Agent Rest API Initialized                   
INFO Cluster is not ready for install. Check validations 
DEBUG Cluster validation: cnv is disabled          
DEBUG Cluster validation: lso is disabled          
DEBUG Cluster validation: lvm is disabled          
DEBUG Cluster validation: odf is disabled          
DEBUG Cluster validation: The pull secret is set.  
WARNING Cluster validation: The cluster has hosts that are not ready to install. 
DEBUG Cluster validation: The cluster has a sufficient number of master candidates. 
DEBUG Cluster validation: The API virtual IP is not required: User Managed Networking 
DEBUG Cluster validation: The API virtual IP is not required: User Managed Networking 
DEBUG Cluster validation: The Cluster Network CIDR is defined. 
DEBUG Cluster validation: The base domain is defined. 
DEBUG Cluster validation: The Ingress virtual IP is not required: User Managed Networking 
DEBUG Cluster validation: The Ingress virtual IP is not required: User Managed Networking 
DEBUG Cluster validation: The Machine Network CIDR is defined. 
DEBUG Cluster validation: The Cluster Machine CIDR is not required: User Managed Networking 
DEBUG Cluster validation: The Cluster Network prefix is valid. 
DEBUG Cluster validation: The cluster has a valid network type 
DEBUG Cluster validation: Same address families for all networks. 
DEBUG Cluster validation: No CIDRS are overlapping. 
DEBUG Cluster validation: No ntp problems found    
DEBUG Cluster validation: The Service Network CIDR is defined. 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Valid inventory exists for the host 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Sufficient CPU cores 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Sufficient minimum RAM 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Sufficient disk capacity 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Sufficient CPU cores for role master 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Sufficient RAM for role master 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Hostname master-00.sno-okd.linuxero-agrio.io is unique in cluster 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Hostname master-00.sno-okd.linuxero-agrio.io is allowed 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Speed of installation disk has not yet been measured 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host is compatible with cluster platform none 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: VSphere disk.EnableUUID is enabled for this virtual machine 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host agent compatibility checking is disabled 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: No request to skip formatting of the installation disk 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: All disks that have skipped formatting are present in the host inventory 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host is connected 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Media device is connected 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: No Machine Network CIDR needed: User Managed Networking 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host belongs to all machine network CIDRs 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host has connectivity to the majority of hosts in the cluster 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Platform Standard PC (Q35 + ICH9, 2009) is allowed 
WARNING Host master-00.sno-okd.linuxero-agrio.io validation: Host couldn't synchronize with any NTP server 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host clock is synchronized with service 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: All required container images were either pulled successfully or no attempt was made to pull them 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Network latency requirement has been satisfied. 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Packet loss requirement has been satisfied. 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host has been configured with at least one default route. 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Domain name resolution for the api.sno-okd.linuxero-agrio.io domain was successful or not required 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Domain name resolution for the api-int.sno-okd.linuxero-agrio.io domain was successful or not required 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Domain name resolution for the *.apps.sno-okd.linuxero-agrio.io domain was successful or not required 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: Host subnets are not overlapping 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: cnv is disabled 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: lso is disabled 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: lvm is disabled 
DEBUG Host master-00.sno-okd.linuxero-agrio.io validation: odf is disabled 
WARNING Host master-00.sno-okd.linuxero-agrio.io: updated status from discovering to insufficient (Host cannot be installed due to following failing validation(s): Host couldn't synchronize with any NTP server) 
INFO Host master-00.sno-okd.linuxero-agrio.io: updated status from insufficient to known (Host is ready to be installed) 
INFO Cluster is ready for install                 
INFO Cluster validation: All hosts in the cluster are ready to install. 
INFO Preparing cluster for installation           
INFO Host master-00.sno-okd.linuxero-agrio.io: updated status from known to preparing-for-installation (Host finished successfully to prepare for installation) 
WARNING Host master-00.sno-okd.linuxero-agrio.io: updated status from preparing-for-installation to insufficient (Host cannot be installed due to following failing validation(s): Host couldn't synchronize with any NTP server) 
INFO Cluster is not ready for install. Check validations 
WARNING Cluster validation: The cluster has hosts that are not ready to install. 
INFO Host master-00.sno-okd.linuxero-agrio.io: New image status quay.io/openshift/okd-content@sha256:bfe30b312af0f8b60b1b33d485ea5d52b8c4353c393e579eb67cf1bb2057c62e. result: success. time: 7.96 seconds; size: 399.08 Megabytes; download rate: 52.54 MBps
JaimeMagiera commented 2 months ago

Hi,

We are not working on FCOS builds of OKD any more. Please see these documents...

https://okd.io/blog/2024/06/01/okd-future-statement https://okd.io/blog/2024/07/30/okd-pre-release-testing

Please test with the OKD SCOS nightlies and file a new issue as needed.

Many thanks,

Jaime