This PR resolves #201 by updating the report function and addresses an out-of-memory error by moving the device buffers from the ConfigurationBase class to the ClArgs struct.
β¨ Change Description/Rationale
Updated report function to remove unnecessary multiplication and subsequent division by nruns
Now it more closely aligns with the calculation of bytes_moved in Spatter v1.1
Moved dev_sparse, dev_sparse_gather, dev_sparse_scatter, and dev_dense buffers to ClArgs
Reduces the memory usage of the CUDA backend when multiple configs are provided
Overview
This PR resolves #201 by updating the
report
function and addresses an out-of-memory error by moving the device buffers from theConfigurationBase
class to theClArgs
struct.β¨ Change Description/Rationale
report
function to remove unnecessary multiplication and subsequent division bynruns
bytes_moved
in Spatter v1.1dev_sparse
,dev_sparse_gather
,dev_sparse_scatter
, anddev_dense
buffers toClArgs
π Reviewer Checklist
β PR Checklist
spatter-devel
π TODOs
π Future Work