Open marcodelapierre opened 1 year ago
Note I was doing my tests based on your blog post at https://hpckp.org/articles/how-to-use-the-slurm-simulator-as-a-development-and-testing-environment/
Today I have found a couple of commands to run after the very first login into the running container with the Slurm simulator (they make sense -- daemon services need to be started):
systemctl start slurmctld
systemctl start slurmd
This page gave me the hint: https://drtailor.medium.com/how-to-setup-slurm-on-ubuntu-20-04-for-single-node-work-scheduling-6cc909574365
It would probably be good if this could be double-checked and added to your original blog post, to enable out-of-the-box tests.
Hi @marcodelapierre ,
Thank you for bringing that to me. The images are designed to start the required services via systemd. If they are not starting, it could be because one of the following reasons:
Can you describe the working environment (OS distro+version, Docker version, Docker from official repo or from Linux distribution)? Have you followed the instructions provided in the official Docker documentation? https://docs.docker.com/engine/install/
BTW, the simulator dockerfiles used for creating the images are in our private Git repository.
Cheers,
Jordi
Thanks for getting back on this Jordi, it is always a pleasure for me to chat to you. (we met a couple of times in Perth, at a HPC/AI conference and at your Kubernetes training at Pawsey in 2020).
I am running these tests on a Ubuntu 22.04 virtual machine on our on-prem Openstack infrastructure.
The docker version is Docker version 24.0.2, build cb74dfc
. Not sure how it was installed (it is part of our pre-canned image), but I can check with the team.
What do you reckon? Thank you
Hi team, @jordiblasco,
thanks for this utility, it looks very promising!
I was reading through your blog post, to try and spawn a vanilla instance of the simulator on a Linux VM I use. I followed the prompts in the blog:
and tested it with three different Slurm versions:
The last command in the snippet above,
sinfo
, always gives me an error:What am I doing wrong? Could you provide some concise guidance to get a minimal working setup?
Thank you in advance, Marco