centerforaisafety / cerberus-cluster

HPC cluster code and configurations for running on OCI
Universal Permissive License v1.0
4 stars 0 forks source link

GPU utilisation report for all users at job completion #275

Open WilliamHodgkins opened 4 months ago

WilliamHodgkins commented 4 months ago

It would be helpful to have a short report (like Nvidia-smi) that shows users the core and memory utilization for their jobs as soon as they're completed. This report should be simple and avoid overloading people with too many figures. The goal is to encourage people to notice when they are under-utilizing resources and prompt them to adjust their behavior