OpenNebula / one

The open source Cloud & Edge Computing Platform bringing real freedom to your Enterprise Cloud 🚀
http://opennebula.io
Apache License 2.0
1.25k stars 480 forks source link

onemonitord in-memory db possible issues with HA #6283

Open tinova opened 1 year ago

tinova commented 1 year ago

Description

When using HA there are some situations that lead to certain hosts containing duplicated monitord client processes with old host_ids no longer present in the host_pool database.

To Reproduce

In HA setup, remove a host from OpenNebula. In some situations this leads to duplicate "monitord" processes running on several hypervisors. There are two processes on each hypervisor: one with the current host ID and the other with the old host ID. The old host ID indicates that these machines were previously part of the zone but were removed for maintenance.

The onemonitord has in memory DB. It should be updated immediately after start from oned.

Expected behavior

Don't monitor old host ids removed from OpenNebula

Details

Progress Status

tinova commented 1 year ago

transferred to OpenNebula/one https://github.com/OpenNebula/one/issues/6283

paczerny commented 1 year ago

Note for later: I wasn't able to reproduce. It looks like the Host state is not updated in onemonitord on slave. If the Host is deleted and leader changes, the non-updated onemonitord keeps starting monitor agent for non existing host