Closed jcanocan closed 11 months ago
Hey @jcanocan,
it worked but requires a tiny modification, see: https://github.com/Azure/azure-sdk-for-python/issues/30983#issuecomment-1647081509
thank you for pointing this out, really appreciated!
It looks like FAR it's not able to find the fence_azure_arm tool in PATH for its purpose.
I think fence_azure_arm
is not installed in FAR's image. Currently it installs fence-agents-all
(and aws
), but it doesn't seem it includes the azure one
➤ docker run --rm -it quay.io/clobrano/fence-agents-remediation-fencing-agents bash
[root@4f3bb118da07 /]# fence_a
fence_amt_ws fence_apc fence_apc_snmp fence_aws
[root@4f3bb118da07 /]# fence_
fence_amt_ws fence_brocade fence_eaton_snmp fence_hpblade fence_ilo2 fence_ilo5 fence_imm fence_kdump fence_rsb fence_vmware_soap
fence_apc fence_cisco_mds fence_emerson fence_ibmblade fence_ilo3 fence_ilo5_ssh fence_intelmodular fence_mpath fence_sbd fence_wti
fence_apc_snmp fence_cisco_ucs fence_eps fence_idrac fence_ilo3_ssh fence_ilo_moonshot fence_ipdu fence_redfish fence_scsi fence_xvm
fence_aws fence_compute fence_evacuate fence_ifmib fence_ilo4 fence_ilo_mp fence_ipmilan fence_rhevm fence_virt
fence_bladecenter fence_drac5 fence_heuristics_ping fence_ilo fence_ilo4_ssh fence_ilo_ssh fence_ipmilanplus fence_rsa fence_vmware_rest
[root@4f3bb118da07 /]# fence_
Thanks for answering back! I'm glad to help :blush:
Regarding https://github.com/Azure/azure-sdk-for-python/issues/30983#issuecomment-1647081509. Looks like they are not motivated to make the change. Moreover, It will take some time to land. Therefore, what do you think about including the following command right after fence-azure-arm
package installation?
RUN sed -i 's/\"instanceView\"/expand=\"instanceView\"/' /usr/sbin/fence_azure_arm
I would agree that it's not a very clean solution, just a workaround. Nevertheless, it will allow the fence agent work.
Looks like they are not motivated to make the change.
It seems they need to propagate the request to the right people :)
I would agree that it's not a very clean solution, just a workaround. Nevertheless, it will allow the fence agent work.
We actually want to decouple the operator's image from the one containing the agents so that one could use an image with a specific fencing agent and the related quirks to make it work.
First of all thanks Javier for noticing/raising the notion of using Azure fence agent!
Looks like they are not motivated to make the change.
Yes, how about creating a PR with the above fix to https://github.com/ClusterLabs/fence-agents/tree/main repo? They are available in their mailing list if you want to discuss about if beforehand.
We actually want to decouple the operator's image from the one containing the agents so that one could use an image > with a specific fencing agent and the related quirks to make it work.
Thanks for letting me know. Sounds nice :)
First of all thanks Javier for noticing/raising the notion of using Azure fence agent!
Looks like they are not motivated to make the change.
Yes, how about creating a PR with the above fix to https://github.com/ClusterLabs/fence-agents/tree/main repo? They are available in their mailing list if you want to discuss about if beforehand.
Thanks for the suggestion. I misinterpreted the words in https://github.com/Azure/azure-sdk-for-python/issues/30983#issuecomment-1647081509, but I just realized that the azure fence agent is independent to the https://github.com/Azure/azure-sdk-for-python. Apologizes for the confusion. So I will try to post a PR fixing this issue in the fence agent.
Meanwhile, I will learn how to build the operator locally and deploy it in an OCP cluster.
Posted https://github.com/ClusterLabs/fence-agents/pull/562. Just in case you are curious :)
Hi!
I'm currently playing around with FAR with Azure VMs. I've been able to install NHC, FAR in an OCP 4.13 cluster, to create the FAR Template and start the remediation process. This is the FAR Template I'm currently using:
I've tried with fence_azure_arm tool standalone locally to restart a faulty VM where an OCP node is running. For that purpose, I stopped the kubelet process to bring a node to an unhealthy state, and it worked but requires a tiny modification, see: https://github.com/Azure/azure-sdk-for-python/issues/30983#issuecomment-1647081509
Nevertheless, it is not working along with FAR operator. It throws the following errors:
It looks like FAR it's not able to find the
fence_azure_arm
tool in PATH for its purpose.Environment:
Thanks in advance!