EESSI / software-layer

Software layer of the EESSI project
https://eessi.github.io/docs/software_layer
GNU General Public License v2.0
23 stars 46 forks source link

{2023.06,zen4} foss/2023b #566

Closed boegel closed 4 months ago

eessi-bot[bot] commented 4 months ago

Instance eessi-bot-mc-aws is configured to build:

eessi-bot[bot] commented 4 months ago

Instance eessi-bot-mc-azure is configured to build:

boegel commented 4 months ago

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - no jobs were submitted
eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - submitted job `72`, for details & status see https://github.com/EESSI/software-layer/pull/566#issuecomment-2098419161
eessi-bot[bot] commented 4 months ago
New job on instance eessi-bot-mc-azure for architecture x86_64-amd-zen4 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.05/pr_566/72 date job status comment
May 07 13:31:52 UTC 2024 submitted job id 72 awaits release by job manager
May 07 13:32:12 UTC 2024 released job awaits launch by Slurm scheduler
May 07 13:57:01 UTC 2024 running job 72 is running
May 07 14:04:13 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-72.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
May 07 14:04:13 UTC 2024 test result
:cry: FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
:white_check_mark: job output file slurm-72.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
boegel commented 4 months ago

GCC build failed with g++: fatal error: Killed signal terminated program cc1plus because not enough memory is available, bot configuration needs to be tweaked on build cluster in Azure

boegel commented 4 months ago

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - no jobs were submitted
eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - submitted job `83`, for details & status see https://github.com/EESSI/software-layer/pull/566#issuecomment-2099263442
eessi-bot[bot] commented 4 months ago
New job on instance eessi-bot-mc-azure for architecture x86_64-amd-zen4 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.05/pr_566/83 date job status comment
May 07 20:34:48 UTC 2024 submitted job id 83 awaits release by job manager
May 07 20:35:52 UTC 2024 released job awaits launch by Slurm scheduler
May 07 22:53:47 UTC 2024 running job 83 is running
May 08 01:15:44 UTC 2024 finished
:grin: SUCCESS (click triangle for details)
Details
:white_check_mark: job output file slurm-83.out
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching FAILED:
:white_check_mark: no message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen4-1715130654.tar.gzsize: 1348 MiB (1414142372 bytes)
entries: 24583
modules under _2023.06/software/linux/x8664/amd/zen4/modules/all
BLIS/0.9.0-GCC-12.3.0.lua
CMake/3.26.3-GCCcore-12.3.0.lua
FFTW.MPI/3.3.10-gompi-2023a.lua
FFTW/3.3.10-GCC-12.3.0.lua
FlexiBLAS/3.3.1-GCC-12.3.0.lua
GCC/12.3.0.lua
GCCcore/12.3.0.lua
OpenBLAS/0.3.23-GCC-12.3.0.lua
OpenMPI/4.1.5-GCC-12.3.0.lua
OpenSSL/1.1.lua
PMIx/4.2.4-GCCcore-12.3.0.lua
Perl/5.36.1-GCCcore-12.3.0.lua
Python/3.11.3-GCCcore-12.3.0.lua
SQLite/3.42.0-GCCcore-12.3.0.lua
ScaLAPACK/2.2.0-gompi-2023a-fb.lua
Tcl/8.6.13-GCCcore-12.3.0.lua
UCC/1.2.0-GCCcore-12.3.0.lua
UCX/1.14.1-GCCcore-12.3.0.lua
UnZip/6.0-GCCcore-12.3.0.lua
cURL/8.0.1-GCCcore-12.3.0.lua
foss/2023a.lua
gompi/2023a.lua
hwloc/2.9.1-GCCcore-12.3.0.lua
libarchive/3.6.2-GCCcore-12.3.0.lua
libevent/2.1.12-GCCcore-12.3.0.lua
libfabric/1.18.0-GCCcore-12.3.0.lua
libffi/3.4.4-GCCcore-12.3.0.lua
libpciaccess/0.17-GCCcore-12.3.0.lua
libxml2/2.11.4-GCCcore-12.3.0.lua
make/4.4.1-GCCcore-12.3.0.lua
numactl/2.0.16-GCCcore-12.3.0.lua
pkgconf/1.8.0.lua
pkgconf/1.9.5-GCCcore-12.3.0.lua
xorg-macros/1.20.0-GCCcore-12.3.0.lua
software under _2023.06/software/linux/x8664/amd/zen4/software
BLIS/0.9.0-GCC-12.3.0
CMake/3.26.3-GCCcore-12.3.0
FFTW.MPI/3.3.10-gompi-2023a
FFTW/3.3.10-GCC-12.3.0
FlexiBLAS/3.3.1-GCC-12.3.0
GCC/12.3.0
GCCcore/12.3.0
OpenBLAS/0.3.23-GCC-12.3.0
OpenMPI/4.1.5-GCC-12.3.0
OpenSSL/1.1
PMIx/4.2.4-GCCcore-12.3.0
Perl/5.36.1-GCCcore-12.3.0
Python/3.11.3-GCCcore-12.3.0
SQLite/3.42.0-GCCcore-12.3.0
ScaLAPACK/2.2.0-gompi-2023a-fb
Tcl/8.6.13-GCCcore-12.3.0
UCC/1.2.0-GCCcore-12.3.0
UCX/1.14.1-GCCcore-12.3.0
UnZip/6.0-GCCcore-12.3.0
cURL/8.0.1-GCCcore-12.3.0
foss/2023a
gompi/2023a
hwloc/2.9.1-GCCcore-12.3.0
libarchive/3.6.2-GCCcore-12.3.0
libevent/2.1.12-GCCcore-12.3.0
libfabric/1.18.0-GCCcore-12.3.0
libffi/3.4.4-GCCcore-12.3.0
libpciaccess/0.17-GCCcore-12.3.0
libxml2/2.11.4-GCCcore-12.3.0
make/4.4.1-GCCcore-12.3.0
numactl/2.0.16-GCCcore-12.3.0
pkgconf/1.8.0
pkgconf/1.9.5-GCCcore-12.3.0
xorg-macros/1.20.0-GCCcore-12.3.0
other under _2023.06/software/linux/x8664/amd/zen4
2023.06/init/eessi_environment_variables
May 08 01:15:44 UTC 2024 test result
:cry: FAILURE (click triangle for details)
Reason
EESSI test suite was not run, test step itself failed to execute.
Details
:white_check_mark: job output file slurm-83.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
bedroge commented 4 months ago

The tarball of job 83 contains init files, probably because #569 wasn't merged yet, so let's try again.

boegel commented 4 months ago

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - no jobs were submitted
boegel commented 4 months ago

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - no jobs were submitted
eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - submitted job `86`, for details & status see https://github.com/EESSI/software-layer/pull/566#issuecomment-2106193745
eessi-bot[bot] commented 4 months ago
New job on instance eessi-bot-mc-azure for architecture x86_64-amd-zen4 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.05/pr_566/86 date job status comment
May 12 10:08:05 UTC 2024 submitted job id 86 awaits release by job manager
May 12 10:09:08 UTC 2024 released job awaits launch by Slurm scheduler
May 12 10:30:38 UTC 2024 running job 86 is running
May 12 11:08:27 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-86.out
:x: found message matching ERROR:
:white_check_mark: no message matching FAILED:
:white_check_mark: no message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
No artefacts were created or found.
May 12 11:08:27 UTC 2024 test result
:cry: FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 10/10 test case(s) from 10 check(s) (1 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-86.out
:x: found message matching ERROR:
:x: found message matching [\s*FAILED\s*].*Ran .* test case
boegel commented 4 months ago

Problem fixed by https://github.com/EESSI/software-layer/pull/573, so time to try again...

bot: build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4

eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - no jobs were submitted
eessi-bot[bot] commented 4 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build repo:eessi.io-2023.06-software arch:x86_64/amd/zen4` from `boegel` - expanded format: `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` - handling command `build repository:eessi.io-2023.06-software architecture:x86_64/amd/zen4` resulted in: - submitted job `98`, for details & status see https://github.com/EESSI/software-layer/pull/566#issuecomment-2114884188
eessi-bot[bot] commented 4 months ago
New job on instance eessi-bot-mc-azure for architecture x86_64-amd-zen4 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.05/pr_566/98 date job status comment
May 16 10:51:02 UTC 2024 submitted job id 98 awaits release by job manager
May 16 10:51:21 UTC 2024 released job awaits launch by Slurm scheduler
May 16 13:01:14 UTC 2024 running job 98 is running
May 16 16:47:25 UTC 2024 finished
:grin: SUCCESS (click triangle for details)
Details
:white_check_mark: job output file slurm-98.out
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching FAILED:
:white_check_mark: no message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen4-1715875826.tar.gzsize: 1455 MiB (1526361231 bytes)
entries: 25000
modules under _2023.06/software/linux/x8664/amd/zen4/modules/all
BLIS/0.9.0-GCC-13.2.0.lua
CMake/3.27.6-GCCcore-13.2.0.lua
FFTW.MPI/3.3.10-gompi-2023b.lua
FFTW/3.3.10-GCC-13.2.0.lua
FlexiBLAS/3.3.1-GCC-13.2.0.lua
GCC/13.2.0.lua
GCCcore/13.2.0.lua
OpenBLAS/0.3.24-GCC-13.2.0.lua
OpenMPI/4.1.6-GCC-13.2.0.lua
OpenSSL/1.1.lua
PMIx/4.2.6-GCCcore-13.2.0.lua
Perl/5.38.0-GCCcore-13.2.0.lua
Python/3.11.5-GCCcore-13.2.0.lua
SQLite/3.43.1-GCCcore-13.2.0.lua
ScaLAPACK/2.2.0-gompi-2023b-fb.lua
Tcl/8.6.13-GCCcore-13.2.0.lua
UCC/1.2.0-GCCcore-13.2.0.lua
UCX/1.15.0-GCCcore-13.2.0.lua
UnZip/6.0-GCCcore-13.2.0.lua
cURL/8.3.0-GCCcore-13.2.0.lua
foss/2023b.lua
gompi/2023b.lua
hwloc/2.9.2-GCCcore-13.2.0.lua
libarchive/3.7.2-GCCcore-13.2.0.lua
libevent/2.1.12-GCCcore-13.2.0.lua
libfabric/1.19.0-GCCcore-13.2.0.lua
libffi/3.4.4-GCCcore-13.2.0.lua
libpciaccess/0.17-GCCcore-13.2.0.lua
libxml2/2.11.5-GCCcore-13.2.0.lua
make/4.4.1-GCCcore-13.2.0.lua
numactl/2.0.16-GCCcore-13.2.0.lua
pkgconf/1.8.0.lua
pkgconf/2.0.3-GCCcore-13.2.0.lua
xorg-macros/1.20.0-GCCcore-13.2.0.lua
software under _2023.06/software/linux/x8664/amd/zen4/software
BLIS/0.9.0-GCC-13.2.0
CMake/3.27.6-GCCcore-13.2.0
FFTW.MPI/3.3.10-gompi-2023b
FFTW/3.3.10-GCC-13.2.0
FlexiBLAS/3.3.1-GCC-13.2.0
GCC/13.2.0
GCCcore/13.2.0
OpenBLAS/0.3.24-GCC-13.2.0
OpenMPI/4.1.6-GCC-13.2.0
OpenSSL/1.1
PMIx/4.2.6-GCCcore-13.2.0
Perl/5.38.0-GCCcore-13.2.0
Python/3.11.5-GCCcore-13.2.0
SQLite/3.43.1-GCCcore-13.2.0
ScaLAPACK/2.2.0-gompi-2023b-fb
Tcl/8.6.13-GCCcore-13.2.0
UCC/1.2.0-GCCcore-13.2.0
UCX/1.15.0-GCCcore-13.2.0
UnZip/6.0-GCCcore-13.2.0
cURL/8.3.0-GCCcore-13.2.0
foss/2023b
gompi/2023b
hwloc/2.9.2-GCCcore-13.2.0
libarchive/3.7.2-GCCcore-13.2.0
libevent/2.1.12-GCCcore-13.2.0
libfabric/1.19.0-GCCcore-13.2.0
libffi/3.4.4-GCCcore-13.2.0
libpciaccess/0.17-GCCcore-13.2.0
libxml2/2.11.5-GCCcore-13.2.0
make/4.4.1-GCCcore-13.2.0
numactl/2.0.16-GCCcore-13.2.0
pkgconf/1.8.0
pkgconf/2.0.3-GCCcore-13.2.0
xorg-macros/1.20.0-GCCcore-13.2.0
other under _2023.06/software/linux/x8664/amd/zen4
no other files in tarball
May 16 16:47:25 UTC 2024 test result
:cry: FAILURE (click triangle for details)
Reason
EESSI test suite produced failures.
ReFrame Summary
[ FAILED ] Ran 10/10 test case(s) from 10 check(s) (1 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-98.out
:x: found message matching ERROR:
:x: found message matching [\s*FAILED\s*].*Ran .* test case
May 21 07:05:11 UTC 2024 uploaded transfer of eessi-2023.06-software-linux-x86_64-amd-zen4-1715875826.tar.gz to S3 bucket succeeded
bedroge commented 4 months ago

@boegel I think this needs a sync with the main branch to solve the failing CI? edit: done.