aws / aws-parallelcluster

AWS ParallelCluster is an AWS supported Open Source cluster management tool to deploy and manage HPC clusters in the AWS cloud.
https://github.com/aws/aws-parallelcluster
Apache License 2.0
830 stars 312 forks source link

Running 'sacctmgr' slurm command in headnode throws an error #4159

Closed ipraveen3 closed 2 years ago

ipraveen3 commented 2 years ago

Description: Running sacctmgr slurm commands like sacctmgr list cluster in headnode gives the following error: You are not running a supported accounting_storage plugin Only 'accounting_storage/slurmdbd' is supported.

Screenshot: error_msg

Please let me know if I have to do any setup before I run this command or how I can resolve this. Do I need to setup mysql with the slurm in order to do this? or the Dynamodb can handle it.

How to reproduce:

  1. Create a cluster using the below mentioned configuration file. hello-world.txt VPC uses the 'Head node in a public subnet and compute fleet in a private subnet' configuration pcluster create-cluster --cluster-name hello-world --cluster-configuration hello-world.yaml
  2. Log in to the head-node pcluster ssh --cluster-name hello-world -i /path/to/keyfile.pem
  3. Run the following command sacctmgr

I also tried running /opt/slurm/bin/sacctmgr list cluster which gave the same error err_woith_opt_path

Running other sacctmgr commands like sacctmgr show clusters , sacctmgr add resource .... also gives the same error.

Required Info:

Logs: /var/log/parallelcluster/clustermgtd: clustermgtd.log

/var/log/cloud-init.log: cloud-init.log

/var/log/chef-client.log: chef-client.log

Let me know if you need any other information

enrico-usai commented 2 years ago

Hi @ipraveen3

the Slurm accounting feature is not present on ParallelCluster. This is something "optional" you have to configure manually.

You can follow this Blog Post with detailed instructions.

Let me know if it helps.

github-actions[bot] commented 2 years ago

This issue has been automatically closed because there has been no response to our request for more information from the original author. With only the information that is currently in the issue, we don't have enough information to take action. Please reach out if you have or find the answers we need so that we can investigate further.

ipraveen3 commented 2 years ago

Thanks it helped. I was able to setup and run sacctmgr and other commands.