OleHolmNielsen / Slurm_tools

My tools for the Slurm HPC workload manager
GNU General Public License v3.0
444 stars 96 forks source link

awk: cmd. line:261: (FILENAME=- FNR=1) fatal: division by zero attempted #28

Open tiansira opened 3 months ago

tiansira commented 3 months ago

Hi when I run this script, I got the below error output, Could you pls have a look, and let me know how to fix it? Thank you.


Notice: Longest hostname length is truncated to 20 Hostname Partition Node Num_CPU CPUload Memsize Freemem Joblist State Use/Tot (15min) (MB) (MB) JobID User ... awk: cmd. line:261: (FILENAME=- FNR=1) fatal: division by zero attempted

OleHolmNielsen commented 3 months ago

Which script are you trying to run?
Please specify also your Slurm version. Thanks.

tiansira commented 2 months ago

I run the pestat script, and my slurm version is slurm 20.11.9. Thanks.

OleHolmNielsen commented 2 months ago

The pestat should work also with old Slurm versions such as 20.11 (it is recommended to upgrade Slurm, however). The warning "Notice: Longest hostname length is truncated to" can be seen in the pestat script and is probably due to very long hostnames of your nodes. I think it is a good idea to use short hostnames without the DNS domain. Can you please show me your hostnames as printed by this command:

$ sinfo -N -O NodeList

Then I will try to reproduce the error that you see.

OleHolmNielsen commented 1 month ago

Any updates on this issue? Thanks, Ole

OleHolmNielsen commented 1 month ago

Any updates on this issue? Thanks, Ole