stormshift / support

This repo should serve as a central source for reporting issues with stormshift
GNU General Public License v3.0
3 stars 0 forks source link

Reinstall OCPx / OCP2 #70

Closed mschindl closed 2 years ago

mschindl commented 2 years ago

Redeployment of OCP2 failed:

TASK [Create cluster (Coffee time - watch /root/ocp2_install/.openshift_install.log)] ***********************************************************************************************
fatal: [localhost]: FAILED! => {"changed": true, "cmd": ["/root/ocp49binaries/openshift-install", "create", "cluster", "--dir=/root/ocp2_install"], "delta": "0:02:01.614524", "end": "2022-03-04 12:18:06.872306", "msg": "non-zero return code", "rc": 1, "start": "2022-03-04 12:16:05.257782", "stderr": "level=info msg=Consuming Install Config from target directory\nlevel=info msg=Obtaining RHCOS image file from 'https://rhcos-redirector.apps.art.xq1c.p1.openshiftapps.com/art/storage/releases/rhcos-4.9/49.84.202110081407-0/x86_64/rhcos-49.84.202110081407-0-openstack.x86_64.qcow2.gz?sha256=bbbb9243f084fc330a2c95e0bf33708d68e17628f48086eac574dcb96d35df9e'\nlevel=info msg=The file was found in cache: /root/.cache/openshift-installer/image_cache/rhcos-49.84.202110081407-0-openstack.x86_64.qcow2. Reusing...\nlevel=info msg=Creating infrastructure resources...\nlevel=error\nlevel=error msg=Error: Fault reason is \"Operation Failed\". Fault detail is \"[Cannot add VM. Illegal number of monitors is provided, max allowed number of monitors is 1 for VNC and the max number in the ValidNumOfMonitors configuration variable for SPICE.]\". HTTP response code is \"400\". HTTP response message is \"400 Bad Request\".\nlevel=error\nlevel=error msg=  on ../../tmp/openshift-install--796536323/template/main.tf line 46, in resource \"ovirt_vm\" \"tmp_import_vm\":\nlevel=error msg=  46: resource \"ovirt_vm\" \"tmp_import_vm\" {\nlevel=error\nlevel=error\nlevel=fatal msg=failed to fetch Cluster: failed to generate asset \"Cluster\": failed to create cluster: failed to apply Terraform: failed to complete the change", "stderr_lines": ["level=info msg=Consuming Install Config from target directory", "level=info msg=Obtaining RHCOS image file from 'https://rhcos-redirector.apps.art.xq1c.p1.openshiftapps.com/art/storage/releases/rhcos-4.9/49.84.202110081407-0/x86_64/rhcos-49.84.202110081407-0-openstack.x86_64.qcow2.gz?sha256=bbbb9243f084fc330a2c95e0bf33708d68e17628f48086eac574dcb96d35df9e'", "level=info msg=The file was found in cache: /root/.cache/openshift-installer/image_cache/rhcos-49.84.202110081407-0-openstack.x86_64.qcow2. Reusing...", "level=info msg=Creating infrastructure resources...", "level=error", "level=error msg=Error: Fault reason is \"Operation Failed\". Fault detail is \"[Cannot add VM. Illegal number of monitors is provided, max allowed number of monitors is 1 for VNC and the max number in the ValidNumOfMonitors configuration variable for SPICE.]\". HTTP response code is \"400\". HTTP response message is \"400 Bad Request\".", "level=error", "level=error msg=  on ../../tmp/openshift-install--796536323/template/main.tf line 46, in resource \"ovirt_vm\" \"tmp_import_vm\":", "level=error msg=  46: resource \"ovirt_vm\" \"tmp_import_vm\" {", "level=error", "level=error", "level=fatal msg=failed to fetch Cluster: failed to generate asset \"Cluster\": failed to create cluster: failed to apply Terraform: failed to complete the change"], "stdout": "", "stdout_lines": []}

/root/ocp2_install/.openshift_install.log

time="2022-03-04T12:18:06+01:00" level=debug msg="module.template.ovirt_image_transfer.releaseimage[0]: Creation complete after 1m49s [id=d3c6b100-8777-462b-b6cc-2335bbf9587d]"
time="2022-03-04T12:18:06+01:00" level=debug msg="module.template.ovirt_vm.tmp_import_vm[0]: Creating..."
time="2022-03-04T12:18:06+01:00" level=error
time="2022-03-04T12:18:06+01:00" level=error msg="Error: Fault reason is \"Operation Failed\". Fault detail is \"[Cannot add VM. Illegal number of monitors is provided, max allowed number of monitors is 1 for VNC and the max number in the ValidNumOfMonitors configuration variable for SPICE.]\". HTTP response code is \"400\". HTTP response message is \"400 Bad Request\"."
time="2022-03-04T12:18:06+01:00" level=error
time="2022-03-04T12:18:06+01:00" level=error msg="  on ../../tmp/openshift-install--796536323/template/main.tf line 46, in resource \"ovirt_vm\" \"tmp_import_vm\":"
time="2022-03-04T12:18:06+01:00" level=error msg="  46: resource \"ovirt_vm\" \"tmp_import_vm\" {"
time="2022-03-04T12:18:06+01:00" level=error
time="2022-03-04T12:18:06+01:00" level=error
time="2022-03-04T12:18:06+01:00" level=fatal msg="failed to fetch Cluster: failed to generate asset \"Cluster\": failed to create cluster: failed to apply Terraform: failed to complete the change"

Cleanup with

[root@stormshiftdeploy ~]# ./ocp49binaries/openshift-install --dir /root/ocp2_install/ destroy cluster
INFO Removing AffinityGroup ocp2-qgfxm-controlplane 
INFO Removing AffinityGroup ocp2-qgfxm-compute    
INFO Time elapsed: 0s  
DanielFroehlich commented 2 years ago

There might be still an issue with RHV Cluster Compatibility Versions. Investigating....

DanielFroehlich commented 2 years ago

It actually was a problem with the default template "Blank" , which had a config that was no longer valid with the new RHV Version. It was with the Console Settings - I switched it to "Headless", and then cluster deployment / vm creating works.

image