Closed sathnaga closed 7 years ago
After some digging, it looks like one vcpu task has handled a hypervisor page fault while the resize code is in the middle of making all the HPTEs absent. The technique which the resize code uses to exclude vcpus from running (set hpte_setup_done to 0 and send an IPI to all CPUs) doesn't actually work since another vcpu task could be in the host handling a page fault or a hcall at the time the IPI is sent, in which case that vcpu task will just handle the IPI and continue to re-enter the guest.
I'm currently trying to think of a reasonable way to fix this...
------- Comment From bssrikanth@in.ibm.com 2017-11-07 01:47:28 EDT------- Paul Mackerras seem to have patch which will hopefully fix this issue.. saw his comments on slack channel of host-os..
Seeing the issue is fixed with latest devel branch update, 4.14.0-2.rc8.dev.gitcc4bf22.el7.centos.ppc64le. will wait for release branch update for this fix to close the issue.
------- Comment From viparash@in.ibm.com 2017-11-07 04:37:28 EDT------- Bug 160904 has been marked as a duplicate of this bug.
Fixed with latest devel branch update, 4.14.0-2.rc8.dev.gitcc4bf22.el7.centos.ppc64le.
------- Comment From lagarcia@br.ibm.com 2017-11-10 20:55:34 EDT------- Sprtin 3 hostos-release branch is closed for new commits. Targeting this one to sprint 4.
Paul,
Could you please cherry-pick this patch into hostos-release as soon as sprint 4 hostos-release branch gets opened?
Verified in latest hostos release branch 4.14.0-1.rel.git68b4afb.el7.centos.ppc64le
(7/9) guest_sanity.hotplug.memory.qemu.qcow2.virtio_scsi.smp2.virtio_net.HostOS.ppc64le.powerkvm-libvirt.libvirt_mem.positive_test.hot_plug: PASS (88.70 s)
Regards, -Satheesh.