LLNL / ATS

ATS - Automated Testing System - is an open-source, Python-based tool for automating the running of tests of an application across a broad range of high performance computers.
BSD 3-Clause "New" or "Revised" License
6 stars 5 forks source link

Ensure ATS can run CPU and GPU jobs concurrently on the same node with Flux #148

Open dawson6 opened 11 months ago

dawson6 commented 11 months ago

Verify (and fix if not verified) that we can

1) Run CPU only tests and CPU+GPU tests on the same node concurrently using flux.

This may mean creating (or tweaking) an existing test setup such that two codes are specified in the ATS test files.

That is

Code A) built for the CPU only. Does not need access to the GPU at all (in particularly for memory access) Code B) Built for the CPU+GPU. That is it will require access to either hipMalloc or hipMallocManaged memory at run time.

Verify we can saturate the nodes (for throughput) with a combination of the above codes.