Open ydirson opened 1 month ago
SMGC claims to have finished at 13:11:27, so not sure what it was doing until 13:13 that it timed out:
Oct 1 13:11:27 host1 systemd[1]: Started Garbage Collector for SR e6e40ee6-0491-0c3f-186c-db3d00a623a7.
Oct 1 13:13:14 host1 emu-manager-4[16196]: Failed to read from xenopsd because timeout reached.
Do you have more logs for that period? The SM log seems to be truncated, or was there really no more activity there?
Hm I cannot rule out a copypaste mistake. Will try to see if I still get the full logfiles, otherwise will upload the next occurrence (already saw this 3 times in 2 days, pretty confident I can reproduce)
During a
vm-checkpoint
on XCP-ng 8.3 (so using xcp-emu-manager), I got a case ofxe vm-checkpoint
never returning. According to the logsxenopsd
got non-responsive but we fail to see why. The log shows a SR GC between the checkpoint start and its failure, featuring errors of its own, involving the VDI holding the VM we're attempting to checkpoint.The problem seems to be manyfold:
Failed to read from xenopsd because timeout reached.
reported byemu-manager
, butxenopsd
does not show anythingemu-manager
process it called has indeed finished with an error, and keeping the task pendingxsensource.log
daemon.log
SMlog