ATS - Automated Testing System - is an open-source, Python-based tool for automating the running of tests of an application across a broad range of high performance computers.
BSD 3-Clause "New" or "Revised" License
6
stars
5
forks
source link
Ensure ATS can run CPU and GPU jobs concurrently on the same node with Flux #148
1) Run CPU only tests and CPU+GPU tests on the same node concurrently using flux.
This may mean creating (or tweaking) an existing test setup such that two codes are specified in the ATS test files.
That is
Code A) built for the CPU only. Does not need access to the GPU at all (in particularly for memory access)
Code B) Built for the CPU+GPU. That is it will require access to either hipMalloc or hipMallocManaged memory at run time.
Verify we can saturate the nodes (for throughput) with a combination of the above codes.
Verify (and fix if not verified) that we can
1) Run CPU only tests and CPU+GPU tests on the same node concurrently using flux.
This may mean creating (or tweaking) an existing test setup such that two codes are specified in the ATS test files.
That is
Code A) built for the CPU only. Does not need access to the GPU at all (in particularly for memory access) Code B) Built for the CPU+GPU. That is it will require access to either hipMalloc or hipMallocManaged memory at run time.
Verify we can saturate the nodes (for throughput) with a combination of the above codes.