SecurityGroup for Lustre was added, ports (inbound): 988, 1018-1023
Attached the SecurityGroup for Lustre in AdditionalSecurityGroups
Question:
I have the base AMI created using pcluster build-image, but I need to install some packages for this AMI, ran yum update or alike, will it be the cause of this?
Running handlers complete
[2024-07-16T13:55:25+09:00] ERROR: Exception handlers complete
Infra Phase failed. 47 resources updated in 01 minutes 16 seconds
[2024-07-16T13:55:25+09:00] FATAL: Stacktrace dumped to /etc/chef/local-mode-cache/cache/cinc-stacktrace.out
[2024-07-16T13:55:25+09:00] FATAL: ---------------------------------------------------------------------------------------
[2024-07-16T13:55:25+09:00] FATAL: PLEASE PROVIDE THE CONTENTS OF THE stacktrace.out FILE (above) IF YOU FILE A BUG REPORT
[2024-07-16T13:55:25+09:00] FATAL: ---------------------------------------------------------------------------------------
[2024-07-16T13:55:25+09:00] FATAL: Mixlib::ShellOut::ShellCommandFailed: lustre[mount fsx] (aws-parallelcluster-environment::fsx line 33) had an error: Mixlib::ShellOut::ShellCommandFailed: mount[/scratch] (aws-parallelcluster-environment::fsx line 33) had an error: Mixlib::ShellOut::ShellCommandFailed: Expected process to exit with [0], but received '19'
---- Begin output of ["mount", "-t", "lustre", "-o", "defaults,_netdev,flock,user_xattr,noatime,noauto,x-systemd.automount", "fs-****.fsx.ap-northeast-1.amazonaws.com@tcp:/1234567", "/scratch"] ----
STDOUT:
STDERR: mount.lustre: mount fs-****.fsx.ap-northeast-1.amazonaws.com@tcp:/1234567 at /scratch failed: No such device
Are the lustre modules loaded?
Check /etc/modprobe.conf and /proc/filesystems
---- End output of ["mount", "-t", "lustre", "-o", "defaults,_netdev,flock,user_xattr,noatime,noauto,x-systemd.automount", "fs-****.fsx.ap-northeast-1.amazonaws.com@tcp:/1234567", "/scratch"] ----
Ran ["mount", "-t", "lustre", "-o", "defaults,_netdev,flock,user_xattr,noatime,noauto,x-systemd.automount", "fs-****.fsx.ap-northeast-1.amazonaws.com@tcp:/1234567", "/scratch"] returned 19
Required Info:
AWS ParallelCluster version [e.g. 3.1.1]: 3.8
Full cluster configuration without any credentials or personal data.
During pcluster create, I'm receiving
AdditionalSecurityGroups
Question:
pcluster build-image
, but I need to install some packages for this AMI, ranyum update
or alike, will it be the cause of this?Required Info:
pcluster describe-cluster
command.