In the spirit of "the default arguments should do the recommended right thing", this should inspect to discover the number of available GPUs and auto-throttle max-gpuprocs to not overwhelm the memory. It's unclear to me if max-gpuprocs is a development leftover or if we have a real use case (including future development) where we would want to set that to be something other than the number of available GPUs. i.e. do we even need that option anymore?
Currently
rrdesi_mpi --gpu ...
with otherwise default arguments runs out of memory:For this to work, you also have to set
--max-gpuprocs 4
to match the number of GPUs on a Perlmutter node:In the spirit of "the default arguments should do the recommended right thing", this should inspect to discover the number of available GPUs and auto-throttle max-gpuprocs to not overwhelm the memory. It's unclear to me if max-gpuprocs is a development leftover or if we have a real use case (including future development) where we would want to set that to be something other than the number of available GPUs. i.e. do we even need that option anymore?
@craigwarner-ufastro @dmargala