OleHolmNielsen / Slurm_tools

My tools for the Slurm HPC workload manager
GNU General Public License v3.0
423 stars 90 forks source link

awk: cmd. line:261: (FILENAME=- FNR=1) fatal: division by zero attempted #28

Open tiansira opened 1 month ago

tiansira commented 1 month ago

Hi when I run this script, I got the below error output, Could you pls have a look, and let me know how to fix it? Thank you.


Notice: Longest hostname length is truncated to 20 Hostname Partition Node Num_CPU CPUload Memsize Freemem Joblist State Use/Tot (15min) (MB) (MB) JobID User ... awk: cmd. line:261: (FILENAME=- FNR=1) fatal: division by zero attempted

OleHolmNielsen commented 4 weeks ago

Which script are you trying to run?
Please specify also your Slurm version. Thanks.

tiansira commented 3 weeks ago

I run the pestat script, and my slurm version is slurm 20.11.9. Thanks.

OleHolmNielsen commented 3 weeks ago

The pestat should work also with old Slurm versions such as 20.11 (it is recommended to upgrade Slurm, however). The warning "Notice: Longest hostname length is truncated to" can be seen in the pestat script and is probably due to very long hostnames of your nodes. I think it is a good idea to use short hostnames without the DNS domain. Can you please show me your hostnames as printed by this command:

$ sinfo -N -O NodeList

Then I will try to reproduce the error that you see.