EESSI / software-layer

Software layer of the EESSI project
https://eessi.github.io/docs/software_layer
GNU General Public License v2.0
23 stars 46 forks source link

[WIP] DEBUG only {2023.06,2023a} PyTorch-bundle v2.1.2 #603

Open trz42 opened 3 months ago

trz42 commented 3 months ago

The main purpose of this PR is to facilitate debugging various issues when building PyTorch-bundle and demonstrating approaches that could solve the issues. It is expected that the fixes provided here are not final.

Initially we will disable all fixes, build for selected architectures and document the errors. We then enable fixes one-by-one and document the results (some error fixed, some new errors, ...).

Note, see the original PR for PyTorch-bundle (https://github.com/EESSI/software-layer/pull/585) for additional discussion about some of the issues listed above.

eessi-bot[bot] commented 3 months ago

Instance eessi-bot-mc-aws is configured to build:

eessi-bot[bot] commented 3 months ago

Instance eessi-bot-mc-azure is configured to build:

trz42 commented 3 months ago

Initially we'll build only for zen2 and aarch64/generic...

bot: build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software bot: build arch:aarch64/generic repo:eessi.io-2023.06-software

eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - submitted job `12607`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2162771800 - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - submitted job `12608`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2162771906
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12607

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/librosa/0.10.1-foss-2023a/lib/python3.11/site-packages/soundfile.py", line 171, in _snd = _ffi.dlopen(_libname) ^^^^^^^^^^^^^^^^^^^^^ OSError: cannot load library 'libsndfile.so.1': libsndfile.so.1: cannot open shared object file: No such file or directory

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "", line 1, in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/x86_64/amd/zen2/software/librosa/0.10.1-foss-2023a/lib/python3.11/site-packages/soundfile.py", line 192, in _snd = _ffi.dlopen(_explicit_libname) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ OSError: cannot load library 'libsndfile.so': libsndfile.so: cannot open shared object file: No such file or directory, (at easybuild/framework/easyblock.py:3669 in _sanity_check_step)


- to work around this error we need a custom `ctypes`

|date|job status|comment|
|----------|----------|------------------------|
|Jun 12 11:27:18 UTC 2024|submitted|job id `12607` awaits release by job manager|
|Jun 12 11:28:21 UTC 2024|released|job awaits launch by Slurm scheduler|
|Jun 12 11:35:26 UTC 2024|running|job `12607` is running|
|Jun 12 12:08:26 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12607.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:x: no message matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-x86_64-amd-zen2-1718193717.tar.gz</code></summary>size: 162 MiB (170635688 bytes)<br/>entries: 6322<br/>modules under _2023.06/software/linux/x86_64/amd/zen2/modules/all_<br/><pre><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>numba/0.58.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/x86_64/amd/zen2/software_<br/><pre><code>imageio/2.33.1-gfbf-2023a</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>numba/0.58.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/x86_64/amd/zen2_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Jun 12 12:08:26 UTC 2024|test result|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Reason_</dt><dd>EESSI test suite produced failures.</dd><dt>_ReFrame Summary_</dt><dd>[  FAILED  ] Ran 12/12 test case(s) from 12 check(s) (2 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12607.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12608

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/librosa/0.10.1-foss-2023a/lib/python3.11/site-packages/soundfile.py", line 171, in _snd = _ffi.dlopen(_libname) ^^^^^^^^^^^^^^^^^^^^^ OSError: cannot load library 'libsndfile.so.1': libsndfile.so.1: cannot open shared object file: No such file or directory

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "", line 1, in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/librosa/0.10.1-foss-2023a/lib/python3.11/site-packages/soundfile.py", line 192, in _snd = _ffi.dlopen(_explicit_libname) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ OSError: cannot load library 'libsndfile.so': libsndfile.so: cannot open shared object file: No such file or directory, (at easybuild/framework/easyblock.py:3669 in _sanity_check_step)


- to work around this error we need a custom `ctypes`

|date|job status|comment|
|----------|----------|------------------------|
|Jun 12 11:27:22 UTC 2024|submitted|job id `12608` awaits release by job manager|
|Jun 12 11:28:19 UTC 2024|released|job awaits launch by Slurm scheduler|
|Jun 12 11:34:23 UTC 2024|running|job `12608` is running|
|Jun 12 12:04:20 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12608.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:x: no message matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-aarch64-generic-1718193401.tar.gz</code></summary>size: 152 MiB (160274969 bytes)<br/>entries: 6322<br/>modules under _2023.06/software/linux/aarch64/generic/modules/all_<br/><pre><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>numba/0.58.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/aarch64/generic/software_<br/><pre><code>imageio/2.33.1-gfbf-2023a</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>numba/0.58.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/aarch64/generic_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Jun 12 12:04:20 UTC 2024|test result|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Reason_</dt><dd>EESSI test suite produced failures.</dd><dt>_ReFrame Summary_</dt><dd>[  FAILED  ] Ran 12/12 test case(s) from 12 check(s) (2 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12608.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
trz42 commented 3 months ago

The two jobs (12607 and 12608) that did not include any fixes failed both in the sanity check for librosa. After enabling the fixes for that by

we repeat the building for the same architectures zen2 and aarch64/generic...

bot: build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software bot: build arch:aarch64/generic repo:eessi.io-2023.06-software

eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - submitted job `12808`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2169398033 - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - submitted job `12809`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2169398074
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12808

date job status comment
Jun 15 12:04:28 UTC 2024 submitted job id 12808 awaits release by job manager
Jun 15 12:04:32 UTC 2024 released job awaits launch by Slurm scheduler
Jun 15 12:10:36 UTC 2024 running job 12808 is running
Jun 15 13:47:58 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-12808.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1718457726.tar.gzsize: 282 MiB (296485955 bytes)
entries: 9314
modules under _2023.06/software/linux/x8664/amd/zen2/modules/all
custom_ctypes/1.2.lua
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
libmad/0.15.1b-GCCcore-12.3.0.lua
librosa/0.10.1-foss-2023a.lua
LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua
NLTK/3.8.1-foss-2023a.lua
numba/0.58.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
SentencePiece/0.2.0-GCC-12.3.0.lua
SoX/14.4.2-GCCcore-12.3.0.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/x8664/amd/zen2/software
custom_ctypes/1.2
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
libmad/0.15.1b-GCCcore-12.3.0
librosa/0.10.1-foss-2023a
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
NLTK/3.8.1-foss-2023a
numba/0.58.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
SentencePiece/0.2.0-GCC-12.3.0
SoX/14.4.2-GCCcore-12.3.0
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/x8664/amd/zen2
2023.06/init/easybuild/eb_hooks.py
Jun 15 13:47:58 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-12808.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12809

date job status comment
Jun 15 12:04:32 UTC 2024 submitted job id 12809 awaits release by job manager
Jun 15 12:05:34 UTC 2024 released job awaits launch by Slurm scheduler
Jun 15 12:11:38 UTC 2024 running job 12809 is running
Jun 15 13:04:14 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-12809.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1718455310.tar.gzsize: 258 MiB (270844195 bytes)
entries: 9169
modules under 2023.06/software/linux/aarch64/generic/modules/all
custom_ctypes/1.2.lua
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
librosa/0.10.1-foss-2023a.lua
LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua
NLTK/3.8.1-foss-2023a.lua
numba/0.58.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under 2023.06/software/linux/aarch64/generic/software
custom_ctypes/1.2
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
librosa/0.10.1-foss-2023a
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
NLTK/3.8.1-foss-2023a
numba/0.58.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under 2023.06/software/linux/aarch64/generic
2023.06/init/easybuild/eb_hooks.py
Jun 15 13:04:14 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-12809.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
trz42 commented 3 months ago

The two jobs (12608 // zen2 and 12609 // aarch64/generic) didn't fail for the earlier reason (import of soundfile failed). They failed for different reasons however (for details see above). We first fix the issue for aarch64/generic (because the build for that architecture failed earlier than the build for zen2). The fix disables the use of the TC_MALLOC library. Because the fix is made for aarch64/generic only, we also check if builds for the other aarch64 are affected by the issue.

bot: build arch:aarch64/generic repo:eessi.io-2023.06-software bot: build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software bot: build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software

eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - submitted job `12813`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2170460546 - handling command `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` resulted in: - submitted job `12814`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2170460599 - handling command `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` resulted in: - submitted job `12815`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2170460654
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12813

Current thread 0x000040002a9e5a00 (most recent call first): File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1268 in TestMaskTransform File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1255 in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/assertion/rewrite.py", line 178 in exec_module File "", line 690 in _load_unlocked File "", line 1149 in _find_and_load_unlocked File "", line 1178 in _find_and_load File "", line 1206 in _gcd_import File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python/3.11.3-GCCcore-12.3.0/lib/python3.11/importlib/init.py", line 126 in import_module File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/pathlib.py", line 565 in im port_path File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/python.py", line 617 in _im porttestmodule File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/python.py", line 528 in _ge tobj File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/python.py", line 310 in obj File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/python.py", line 545 in _in ject_setup_module_fixture File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/python.py", line 531 in col lect File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/runner.py", line 372 in <la mbda> File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/runner.py", line 341 in fro m_call File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/runner.py", line 372 in pyt est_make_collect_report File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_callers.py", line 80 in _multicall File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_manager.py", line 112 in _hookexec File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_hooks.py", line 433 in call File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/runner.py", line 547 in col lect_one_node File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 836 in genit ems File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 839 in genit ems File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 669 in perfo rm_collect File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 334 in pytes t_collection File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_callers.py", line 80 in _multicall File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_manager.py", line 112 in _hookexec File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_hooks.py", line 433 in call File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 323 in _main File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/pytest/main.py", line 270 in wrap session File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/main.py", line 317 in pytes t_cmdline_main File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_callers.py", line 80 in _multicall File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_manager.py", line 112 in _hookexec File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/hatchling/1.18.0-GCCcore-12.3.0/lib/python3.11/site-packages/pluggy/_hooks.py", line 433 in call File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/config/init.py", line 1 66 in main File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/config/init.py", line 1 89 in console_main File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/bin/pytest", line 8 in

Extension modules: numpy.core._multiarray_umath, numpy.core._multiarray_tests, numpy.linalg._umath_linalg, numpy.fft._pocketfft_internal, numpy.random._common, numpy.random.bit_generator, numpy.ra ndom._bounded_integers, numpy.random._mt19937, numpy.random.mtrand, numpy.random._philox, numpy.random._pcg64, numpy.random._sfc64, numpy.random._generator, torch._C, torch._C._fft, torch._C._lina lg, torch._C._nested, torch._C._nn, torch._C._sparse, torch._C._special, gmpy2.gmpy2, simplejson._speedups (total: 22)



- it may be that we have seen that earlier when building for NESSI ... we didn't have a fix for that there, so this requires more investigation

|date|job status|comment|
|----------|----------|------------------------|
|Jun 15 18:07:39 UTC 2024|submitted|job id `12813` awaits release by job manager|
|Jun 15 18:08:23 UTC 2024|released|job awaits launch by Slurm scheduler|
|Jun 15 18:13:30 UTC 2024|running|job `12813` is running|
|Jun 15 19:09:48 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12813.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:x: no message matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-aarch64-generic-1718477177.tar.gz</code></summary>size: 271 MiB (284370882 bytes)<br/>entries: 9314<br/>modules under _2023.06/software/linux/aarch64/generic/modules/all_<br/><pre><code>custom_ctypes/1.2.lua</code><br/><code>gperftools/2.12-GCCcore-12.3.0.lua</code><br/><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0.lua</code><br/><code>librosa/0.10.1-foss-2023a.lua</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>numba/0.58.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0.lua</code><br/><code>SoX/14.4.2-GCCcore-12.3.0.lua</code><br/><code>tensorboard/2.15.1-gfbf-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/aarch64/generic/software_<br/><pre><code>custom_ctypes/1.2</code><br/><code>gperftools/2.12-GCCcore-12.3.0</code><br/><code>imageio/2.33.1-gfbf-2023a</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0</code><br/><code>librosa/0.10.1-foss-2023a</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>numba/0.58.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0</code><br/><code>SoX/14.4.2-GCCcore-12.3.0</code><br/><code>tensorboard/2.15.1-gfbf-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/aarch64/generic_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Jun 15 19:09:48 UTC 2024|test result|<details><summary>:grin: SUCCESS _(click triangle for details)_</summary><dl><dt>_ReFrame Summary_</dt><dd>[  PASSED  ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12813.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:white_check_mark: no message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_n1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12814

date job status comment
Jun 15 18:07:43 UTC 2024 submitted job id 12814 awaits release by job manager
Jun 15 18:08:25 UTC 2024 released job awaits launch by Slurm scheduler
Jun 15 18:14:32 UTC 2024 running job 12814 is running
Jun 15 19:06:45 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-12814.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_n1-1718477049.tar.gzsize: 258 MiB (271203990 bytes)
entries: 9169
modules under _2023.06/software/linux/aarch64/neoversen1/modules/all
custom_ctypes/1.2.lua
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
librosa/0.10.1-foss-2023a.lua
LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua
NLTK/3.8.1-foss-2023a.lua
numba/0.58.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/aarch64/neoversen1/software
custom_ctypes/1.2
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
librosa/0.10.1-foss-2023a
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
NLTK/3.8.1-foss-2023a
numba/0.58.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/aarch64/neoversen1
2023.06/init/easybuild/eb_hooks.py
Jun 15 19:06:45 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-12814.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12815

date job status comment
Jun 15 18:07:47 UTC 2024 submitted job id 12815 awaits release by job manager
Jun 15 18:08:27 UTC 2024 released job awaits launch by Slurm scheduler
Jun 15 18:14:34 UTC 2024 running job 12815 is running
Jun 15 18:52:16 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-12815.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-neoverse_v1-1718476614.tar.gzsize: 258 MiB (270948899 bytes)
entries: 9169
modules under _2023.06/software/linux/aarch64/neoversev1/modules/all
custom_ctypes/1.2.lua
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
librosa/0.10.1-foss-2023a.lua
LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua
NLTK/3.8.1-foss-2023a.lua
numba/0.58.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/aarch64/neoversev1/software
custom_ctypes/1.2
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
librosa/0.10.1-foss-2023a
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
NLTK/3.8.1-foss-2023a
numba/0.58.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/aarch64/neoversev1
2023.06/init/easybuild/eb_hooks.py
Jun 15 18:52:16 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-12815.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
trz42 commented 3 months ago

Rebuilding for aarch64/neoverse_n1 and aarch64/neoverse_v1 after fix for SentencePiece has been extended to these architectures...

bot: build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software bot: build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software

eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` resulted in: - submitted job `12816`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2170583100 - handling command `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` resulted in: - submitted job `12817`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2170583352
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:aarch64/neoverse_n1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/neoverse_v1 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/neoverse_n1 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/neoverse_v1 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_n1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12816

Current thread 0x000040003d3e5a80 (most recent call first): File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1268 in TestMaskTransform File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1255 in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_n1/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/assertion/rewrite.py", line 178 in exec_module File "", line 690 in _load_unlocked File "", line 1149 in _find_and_load_unlocked File "", line 1178 in _find_and_load File "", line 1206 in _gcd_import ...



|date|job status|comment|
|----------|----------|------------------------|
|Jun 15 19:34:52 UTC 2024|submitted|job id `12816` awaits release by job manager|
|Jun 15 19:35:52 UTC 2024|released|job awaits launch by Slurm scheduler|
|Jun 15 19:36:56 UTC 2024|running|job `12816` is running|
|Jun 15 20:35:33 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12816.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:x: no message matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-aarch64-neoverse_n1-1718482255.tar.gz</code></summary>size: 271 MiB (284726536 bytes)<br/>entries: 9314<br/>modules under _2023.06/software/linux/aarch64/neoverse_n1/modules/all_<br/><pre><code>custom_ctypes/1.2.lua</code><br/><code>gperftools/2.12-GCCcore-12.3.0.lua</code><br/><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0.lua</code><br/><code>librosa/0.10.1-foss-2023a.lua</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>numba/0.58.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0.lua</code><br/><code>SoX/14.4.2-GCCcore-12.3.0.lua</code><br/><code>tensorboard/2.15.1-gfbf-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/aarch64/neoverse_n1/software_<br/><pre><code>custom_ctypes/1.2</code><br/><code>gperftools/2.12-GCCcore-12.3.0</code><br/><code>imageio/2.33.1-gfbf-2023a</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0</code><br/><code>librosa/0.10.1-foss-2023a</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>numba/0.58.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0</code><br/><code>SoX/14.4.2-GCCcore-12.3.0</code><br/><code>tensorboard/2.15.1-gfbf-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/aarch64/neoverse_n1_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Jun 15 20:35:33 UTC 2024|test result|<details><summary>:grin: SUCCESS _(click triangle for details)_</summary><dl><dt>_ReFrame Summary_</dt><dd>[  PASSED  ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12816.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:white_check_mark: no message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-neoverse_v1 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/12817

Current thread 0x000040003cc75a80 (most recent call first): File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1268 in TestMaskTransform File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1255 in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/neoverse_v1/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/assertion/rewrite.py", line 178 in exec_module File "", line 690 in _load_unlocked File "", line 1149 in _find_and_load_unlocked File "", line 1178 in _find_and_load File "", line 1206 in _gcd_import



|date|job status|comment|
|----------|----------|------------------------|
|Jun 15 19:34:56 UTC 2024|submitted|job id `12817` awaits release by job manager|
|Jun 15 19:35:54 UTC 2024|released|job awaits launch by Slurm scheduler|
|Jun 15 19:36:58 UTC 2024|running|job `12817` is running|
|Jun 15 20:18:15 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12817.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:x: no message matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-aarch64-neoverse_v1-1718481760.tar.gz</code></summary>size: 271 MiB (284470404 bytes)<br/>entries: 9314<br/>modules under _2023.06/software/linux/aarch64/neoverse_v1/modules/all_<br/><pre><code>custom_ctypes/1.2.lua</code><br/><code>gperftools/2.12-GCCcore-12.3.0.lua</code><br/><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0.lua</code><br/><code>librosa/0.10.1-foss-2023a.lua</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>numba/0.58.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0.lua</code><br/><code>SoX/14.4.2-GCCcore-12.3.0.lua</code><br/><code>tensorboard/2.15.1-gfbf-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/aarch64/neoverse_v1/software_<br/><pre><code>custom_ctypes/1.2</code><br/><code>gperftools/2.12-GCCcore-12.3.0</code><br/><code>imageio/2.33.1-gfbf-2023a</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0</code><br/><code>librosa/0.10.1-foss-2023a</code><br/><code>LLVM/14.0.6-GCCcore-12.3.0-llvmlite</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>numba/0.58.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0</code><br/><code>SoX/14.4.2-GCCcore-12.3.0</code><br/><code>tensorboard/2.15.1-gfbf-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/aarch64/neoverse_v1_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Jun 15 20:18:15 UTC 2024|test result|<details><summary>:grin: SUCCESS _(click triangle for details)_</summary><dl><dt>_ReFrame Summary_</dt><dd>[  PASSED  ] Ran 12/12 test case(s) from 12 check(s) (0 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-12817.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:white_check_mark: no message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
trz42 commented 3 months ago

Rebuilding for zen2 to verify if a new easyblock for torchvision fixes the issue that libjpeg couldn't be find...

bot: build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software

eessi-build-deploy-bot-deucalion[bot] commented 3 months ago
Updates by the bot instance boegel-bot-deucalion (click for details) - account `trz42` has NO permission to send commands to the bot
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - submitted job `13549`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2198332951
eessi-bot[bot] commented 3 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 months ago

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.06/pr_603/13549

date job status comment
Jun 29 20:55:20 UTC 2024 submitted job id 13549 awaits release by job manager
Jun 29 20:55:26 UTC 2024 released job awaits launch by Slurm scheduler
Jun 29 21:00:28 UTC 2024 running job 13549 is running
Jun 29 23:04:35 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-13549.out
:x: found message matching ERROR:
:white_check_mark: no message matching FAILED:
:x: found message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1719701425.tar.gzsize: 293 MiB (307397497 bytes)
entries: 10800
modules under _2023.06/software/linux/x8664/amd/zen2/modules/all
custom_ctypes/1.2.lua
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
libmad/0.15.1b-GCCcore-12.3.0.lua
librosa/0.10.1-foss-2023a.lua
LLVM/14.0.6-GCCcore-12.3.0-llvmlite.lua
NLTK/3.8.1-foss-2023a.lua
numba/0.58.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
PyTorch-bundle/2.1.2-foss-2023a.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
SentencePiece/0.2.0-GCC-12.3.0.lua
SoX/14.4.2-GCCcore-12.3.0.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/x8664/amd/zen2/software
custom_ctypes/1.2
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
libmad/0.15.1b-GCCcore-12.3.0
librosa/0.10.1-foss-2023a
LLVM/14.0.6-GCCcore-12.3.0-llvmlite
NLTK/3.8.1-foss-2023a
numba/0.58.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
PyTorch-bundle/2.1.2-foss-2023a
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
SentencePiece/0.2.0-GCC-12.3.0
SoX/14.4.2-GCCcore-12.3.0
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/x8664/amd/zen2
2023.06/init/easybuild/eb_hooks.py
Jun 29 23:04:35 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 14/14 test case(s) from 14 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-13549.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
boegel commented 2 months ago

Rebuilding for zen2 to verify if a new easyblock for torchvision fixes the issue that libjpeg couldn't be find...

Maybe related to:

trz42 commented 2 months ago

Rebuilding after #655 got merged to verify if the import soundfile in librosa's sanity check succeeds...

bot: build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software bot: build arch:aarch64/generic repo:eessi.io-2023.06-software

eessi-bot[bot] commented 2 months ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - submitted job `15500`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2262217653 - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - submitted job `15501`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2262217740
eessi-bot[bot] commented 2 months ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 2 months ago

New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.08/pr_603/15500

date job status comment
Aug 01 07:12:23 UTC 2024 submitted job id 15500 awaits release by job manager
Aug 01 07:12:54 UTC 2024 released job awaits launch by Slurm scheduler
Aug 01 07:18:58 UTC 2024 running job 15500 is running
Aug 01 09:08:30 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-15500.out
:x: found message matching ERROR:
:white_check_mark: no message matching FAILED:
:x: found message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1722502307.tar.gzsize: 154 MiB (162347107 bytes)
entries: 6301
modules under _2023.06/software/linux/x8664/amd/zen2/modules/all
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
libmad/0.15.1b-GCCcore-12.3.0.lua
NLTK/3.8.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
PyTorch-bundle/2.1.2-foss-2023a.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
SentencePiece/0.2.0-GCC-12.3.0.lua
SoX/14.4.2-GCCcore-12.3.0.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/x8664/amd/zen2/software
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
libmad/0.15.1b-GCCcore-12.3.0
NLTK/3.8.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
PyTorch-bundle/2.1.2-foss-2023a
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
SentencePiece/0.2.0-GCC-12.3.0
SoX/14.4.2-GCCcore-12.3.0
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/x8664/amd/zen2
2023.06/init/easybuild/eb_hooks.py
Aug 01 09:08:30 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 16/16 test case(s) from 16 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-15500.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
eessi-bot[bot] commented 2 months ago

New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.08/pr_603/15501

Current thread 0x000040003ebf5a00 (most recent call first): File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1268 in TestMaskTransform File "/tmp/bot/easybuild/build/PyTorchbundle/2.1.2/foss-2023a/torchtext/text-0.16.2/test/torchtext_unittest/test_transforms.py", line 1255 in File "/cvmfs/software.eessi.io/versions/2023.06/software/linux/aarch64/generic/software/Python-bundle-PyPI/2023.06-GCCcore-12.3.0/lib/python3.11/site-packages/_pytest/assertion/rewrite.py", line 178 in exec_module File "", line 690 in _load_unlocked File "", line 1149 in _find_and_load_unlocked File "", line 1178 in _find_and_load File "", line 1206 in _gcd_import


- librosa has already been ingested (hence sanity check wasn't run at all)

|date|job status|comment|
|----------|----------|------------------------|
|Aug 01 07:12:27 UTC 2024|submitted|job id `15501` awaits release by job manager|
|Aug 01 07:12:52 UTC 2024|released|job awaits launch by Slurm scheduler|
|Aug 01 07:18:56 UTC 2024|running|job `15501` is running|
|Aug 01 08:14:17 UTC 2024|finished|<details><summary>:cry: FAILURE _(click triangle for details)_</summary><dl><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-15501.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:x: found message matching <code>FAILED: </code><br/>:x: found message matching <code> required modules missing:</code><br/>:white_check_mark: found message(s) matching <code>No missing installations</code><br/>:white_check_mark: found message matching <code>\.tar\.gz created!</code><br/></dd><dt>_Artefacts_</dt><dd><details><summary><code>eessi-2023.06-software-linux-aarch64-generic-1722497905.tar.gz</code></summary>size: 142 MiB (149117531 bytes)<br/>entries: 4815<br/>modules under _2023.06/software/linux/aarch64/generic/modules/all_<br/><pre><code>gperftools/2.12-GCCcore-12.3.0.lua</code><br/><code>imageio/2.33.1-gfbf-2023a.lua</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0.lua</code><br/><code>NLTK/3.8.1-foss-2023a.lua</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0.lua</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0.lua</code><br/><code>scikit-image/0.22.0-foss-2023a.lua</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0.lua</code><br/><code>SoX/14.4.2-GCCcore-12.3.0.lua</code><br/><code>tensorboard/2.15.1-gfbf-2023a.lua</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0.lua</code><br/></pre>software under _2023.06/software/linux/aarch64/generic/software_<br/><pre><code>gperftools/2.12-GCCcore-12.3.0</code><br/><code>imageio/2.33.1-gfbf-2023a</code><br/><code>libmad/0.15.1b-GCCcore-12.3.0</code><br/><code>NLTK/3.8.1-foss-2023a</code><br/><code>parameterized/0.9.0-GCCcore-12.3.0</code><br/><code>Scalene/1.5.26-GCCcore-12.3.0</code><br/><code>scikit-image/0.22.0-foss-2023a</code><br/><code>SentencePiece/0.2.0-GCC-12.3.0</code><br/><code>SoX/14.4.2-GCCcore-12.3.0</code><br/><code>tensorboard/2.15.1-gfbf-2023a</code><br/><code>tqdm/4.66.1-GCCcore-12.3.0</code><br/></pre>other under _2023.06/software/linux/aarch64/generic_<br/><pre><code>2023.06/init/easybuild/eb_hooks.py</code><br/></pre></details></dd></dl></details>|
|Aug 01 08:14:17 UTC 2024|test result|<details><summary>:grin: SUCCESS _(click triangle for details)_</summary><dl><dt>_ReFrame Summary_</dt><dd>[  PASSED  ] Ran 16/16 test case(s) from 16 check(s) (0 failure(s), 0 skipped, 0 aborted)</dd><dt>_Details_</dt><dd>:white_check_mark: job output file <code>slurm-15501.out</code><br/>:x: found message matching <code>ERROR: </code><br/>:white_check_mark: no message matching <code>\[\s\*FAILED\s\*\].\*Ran .\* test case</code><br/></dd></dl></details>|
trz42 commented 1 month ago

Rebuilding after changes have been minimised (only hook for SentencePiece kept for now) and #660 has been ingested...

bot: build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software

eessi-bot[bot] commented 1 month ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - submitted job `15895`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2275485940
eessi-bot[bot] commented 1 month ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:x86_64/amd/zen2 repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` - handling command `build architecture:x86_64/amd/zen2 repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 1 month ago
New job on instance eessi-bot-mc-aws for architecture x86_64-amd-zen2 for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.08/pr_603/15895 date job status comment
Aug 08 10:29:43 UTC 2024 submitted job id 15895 awaits release by job manager
Aug 08 10:30:06 UTC 2024 released job awaits launch by Slurm scheduler
Aug 08 10:36:09 UTC 2024 running job 15895 is running
Aug 08 12:26:42 UTC 2024 finished
:grin: SUCCESS (click triangle for details)
Details
:white_check_mark: job output file slurm-15895.out
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching FAILED:
:white_check_mark: no message matching required modules missing:
:white_check_mark: found message(s) matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-x86_64-amd-zen2-1723118912.tar.gzsize: 154 MiB (162356101 bytes)
entries: 6302
modules under _2023.06/software/linux/x8664/amd/zen2/modules/all
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
libmad/0.15.1b-GCCcore-12.3.0.lua
NLTK/3.8.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
PyTorch-bundle/2.1.2-foss-2023a.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
SentencePiece/0.2.0-GCC-12.3.0.lua
SoX/14.4.2-GCCcore-12.3.0.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under _2023.06/software/linux/x8664/amd/zen2/software
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
libmad/0.15.1b-GCCcore-12.3.0
NLTK/3.8.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
PyTorch-bundle/2.1.2-foss-2023a
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
SentencePiece/0.2.0-GCC-12.3.0
SoX/14.4.2-GCCcore-12.3.0
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under _2023.06/software/linux/x8664/amd/zen2
2023.06/init/easybuild/eb_hooks.py
2023.06/init/eessi_archdetect.sh
Aug 08 12:26:42 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 17/17 test case(s) from 17 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-15895.out
:white_check_mark: no message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
trz42 commented 3 weeks ago

Revisit switching off TCMALLOC...

bot: build arch:aarch64/generic repo:eessi.io-2023.06-software

eessi-bot[bot] commented 3 weeks ago
Updates by the bot instance eessi-bot-mc-aws (click for details) - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - submitted job `17634`, for details & status see https://github.com/EESSI/software-layer/pull/603#issuecomment-2326409002
eessi-bot[bot] commented 3 weeks ago
Updates by the bot instance eessi-bot-mc-azure (click for details) - received bot command `build arch:aarch64/generic repo:eessi.io-2023.06-software` from `trz42` - expanded format: `build architecture:aarch64/generic repository:eessi.io-2023.06-software` - handling command `build architecture:aarch64/generic repository:eessi.io-2023.06-software` resulted in: - no jobs were submitted
eessi-bot[bot] commented 3 weeks ago
New job on instance eessi-bot-mc-aws for architecture aarch64-generic for repository eessi.io-2023.06-software in job dir /project/def-users/SHARED/jobs/2024.09/pr_603/17634 date job status comment
Sep 03 12:33:52 UTC 2024 submitted job id 17634 awaits release by job manager
Sep 03 12:34:22 UTC 2024 released job awaits launch by Slurm scheduler
Sep 03 12:40:25 UTC 2024 running job 17634 is running
Sep 03 13:53:08 UTC 2024 finished
:cry: FAILURE (click triangle for details)
Details
:white_check_mark: job output file slurm-17634.out
:x: found message matching ERROR:
:x: found message matching FAILED:
:x: found message matching required modules missing:
:x: no message matching No missing installations
:white_check_mark: found message matching .tar.gz created!
Artefacts
eessi-2023.06-software-linux-aarch64-generic-1725368522.tar.gzsize: 142 MiB (149130803 bytes)
entries: 4815
modules under 2023.06/software/linux/aarch64/generic/modules/all
gperftools/2.12-GCCcore-12.3.0.lua
imageio/2.33.1-gfbf-2023a.lua
libmad/0.15.1b-GCCcore-12.3.0.lua
NLTK/3.8.1-foss-2023a.lua
parameterized/0.9.0-GCCcore-12.3.0.lua
Scalene/1.5.26-GCCcore-12.3.0.lua
scikit-image/0.22.0-foss-2023a.lua
SentencePiece/0.2.0-GCC-12.3.0.lua
SoX/14.4.2-GCCcore-12.3.0.lua
tensorboard/2.15.1-gfbf-2023a.lua
tqdm/4.66.1-GCCcore-12.3.0.lua
software under 2023.06/software/linux/aarch64/generic/software
gperftools/2.12-GCCcore-12.3.0
imageio/2.33.1-gfbf-2023a
libmad/0.15.1b-GCCcore-12.3.0
NLTK/3.8.1-foss-2023a
parameterized/0.9.0-GCCcore-12.3.0
Scalene/1.5.26-GCCcore-12.3.0
scikit-image/0.22.0-foss-2023a
SentencePiece/0.2.0-GCC-12.3.0
SoX/14.4.2-GCCcore-12.3.0
tensorboard/2.15.1-gfbf-2023a
tqdm/4.66.1-GCCcore-12.3.0
other under 2023.06/software/linux/aarch64/generic
2023.06/init/easybuild/eb_hooks.py
Sep 03 13:53:08 UTC 2024 test result
:grin: SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 18/18 test case(s) from 18 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
:white_check_mark: job output file slurm-17634.out
:x: found message matching ERROR:
:white_check_mark: no message matching [\s*FAILED\s*].*Ran .* test case
trz42 commented 3 weeks ago

Maybe switch off the following

https://github.com/google/sentencepiece/blob/d8f741853847553169444afc12c00f4bbff3e9ce/src/CMakeLists.txt#L271-L274