Closed haiderim closed 6 years ago
The plugin is unable to find any Disk Stores... Do you have any configured?
ssh to your esxi box and run the following commands. Update here with the results.
# esxcli storage filesystem list
# df
It's indeed weird, I can't query the datastore using either esxcli or df but can browse it using web view. I'll try another host then and report back.
So I rebooted the host and it worked. Thanks for the pointing it out, I never would've guessed as everything looked ok in webview.
Thanks for confirming your work-around. (The reboot) I've seen this problem a few times now and a reboot has always fixed the problem. Right now I'm guessing there was IO disk errors on one or more of your Disk Stores that's creating this problem, but I'm not 100% sure.
I'm interested to see if a rescan fixes this issue. If you or anyone has a similar issue (one or more Disk Stores are not see by esxcli and/or df command, please run the rescan option and update this thread with the details.
esxcli storage filesystem rescan
I observed the same behavior on one of my ESXi hosts.
To me, it happened randomly, even in a multi-machine setup. It happened, e.g., for 2 out of 5 machines, each with a similar setup. Re-running the deployment a few times then lead to a success. So, the behavior is not reproducible in a deterministic manner.
The error message of the command esxcli storage filesystem list
says
Error on command storage filesystem list. Error was Invalid name specified for MetaStructure: 'FilesystemVolume'
I got that message by intercepting esxcli storage filesystem list
in the createvm.rb
action with a tee
command as follows
# Figure out DataStore
r = ssh.exec!(
'esxcli storage filesystem list | tee "$(mktemp -p /tmp)" | grep "/vmfs/volumes/.*[VMFS|NFS]" | '\
"sort -nk7 | awk '{print $2}'")
I never saw it when directly SSHing to the esxi host and issuing it as it is proposed by @josenk above.
When googling the error message, you find one direct hint on VMWARE (here), which leads to a patch. Meanwhile, I installed all patches since ESXi 5.5u1 release and the host now seems to behave well. I ran 5 deployments of that multi-machine setup with two reboots of the host. The failure did not show-up again. I will keep you updated. Keep your fingers crossed :-)
p.s. I observed this behavior only on one of my ESXi hosts, which runs on ESXi 5.5 on an HP ProLiant DL580 G5. I cannot remember having it seen on the other host, which uses now ESXi 6.7 and an ESXi 6.5.0a before. I can see from the ESXi build number of the case above that is was with an ESXi 6.5 U1g, which is in between my 6.5.0a and the 6.7. So, I am not sure whether the patches and updates really solve the problem.
Great to know a patch could fix the problem...
@josenk esxcli storage filesystem rescan did fix the problem for me. Sorry for the delayed update
vagrant version
lsb_release -a
My Vagrantfile
vagrant up --provider=vmware_esxi
vagrant ssh
vagrant status
I don't know what's wrong here as I can ssh into the host just fine.