IBM / cloud-pak-deployer

Configuration-based installation of OpenShift and Cloud Pak for Data/Integration/Watson AIOps on various private and public cloud infrastructure providers. Deployment attempts to achieve the end-state defined in the configuration. If something fails along the way, you only need to restart the process to continue the deployment.
https://ibm.github.io/cloud-pak-deployer/
Apache License 2.0
140 stars 68 forks source link

Openshift 4.16 IPI on AWS fails due to incorrect Security Group Names #802

Open Alan111S opened 3 weeks ago

Alan111S commented 3 weeks ago

Trying to install OpenShift 4.16 on AWS and this error is being triggered:- automation-roles/30-provision-infra/provision-aws/tasks/aws-enable-openshift-fileserver.yml line 35 msg: "Security group with name {{ _current_openshift_cluster.name }}*-worker-sg not found"

<cluster-id>-node
<cluster-id>-lb
<cluster-id>-control-plane
<cluster-id>-apiserver-lb

These sg's are attached to the worker nodes:-

<cluster-id>-node
<cluster-id>-lb

cloud-pak-deployer.log cloud-pak-deployer-log-1728016934.tar.gz staging1-delete-cluster.log

fketelaars commented 3 weeks ago

@Alan111S you can check the fix in branch aws-changes

Alan111S commented 1 week ago

@fketelaars There seems to be a problem with this change on AWS. I saw this when creating a 4.14 cluster. Looks like a float vs string comparison issue:-

TASK [provision-aws : Set name of compute node security group for OpenShift < 4.16] ***
Tuesday 22 October 2024  07:05:21 +0000 (0:00:00.071)       0:03:18.657 *******
fatal: [localhost]: FAILED! => {"msg": "The conditional check '_current_openshift_cluster.ocp_version < '4.16'' failed. The error was: Unexpected templating type error occurred on ({% if _current_openshift_cluster.ocp_version < '4.16' %} True {% else %} False {% endif %}): '<' not supported between instances of 'float' and 'str'\n\nThe error appears to be in '/cloud-pak-deployer/automation-roles/30-provision-infra/provision-aws/tasks/aws-enable-openshift-fileserver.yml': line 18, column 3, but may\nbe elsewhere in the file depending on the exact syntax problem.\n\nThe offending line appears to be:\n\n\n- name: Set name of compute node security group for OpenShift < 4.16\n  ^ here\n"}
fketelaars commented 2 days ago

@Alan111S , can you please put the ocp_version attribute between single or double quotes?