stackhpc / ansible-slurm-appliance

A Slurm-based HPC workload management environment, driven by Ansible.
43 stars 18 forks source link

WIP: Containerise prometheus #308

Open sjpb opened 1 year ago

sjpb commented 1 year ago

WIP, with problems - see https://wiki.stackhpc.com/doc/containerised-prometheus-oLfxe5Es6K for notes

TODO:

sjpb commented 9 months ago

Tested an update (via branch switch and running site.yml again) from bfa719f works, i.e. monitoring from before update is visible afterwards.

sjpb commented 9 months ago

Fat image build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/7411834185 - openhpc-240104-1602-262d12b5

sjpb commented 9 months ago

New image build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/7423991616

sjpb commented 8 months ago

Checked at 3cff73b that upgrading an azimuth-deployed slurm to this works on azimuth from azimuth-config d2e6ee2 / v0.3.2: