ICLDisco / parsec

PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core heterogeneous architectures. PaRSEC assigns computation threads to the cores, GPU accelerators, overlaps communications and computations and uses a dynamic, fully-distributed scheduler based on architectural features such as NUMA nodes and algorithmic features such as data reuse.
Other
47 stars 16 forks source link

m, n, and k report 'none' in dtd traces #485

Open DSMishler opened 1 year ago

DSMishler commented 1 year ago

Using PaRSEC commit 7f81a1b (2023-01-19) and DPLASMA commit 75012ef3f (2023-01-23), the m, n, and k data in the trace of a GEMM is not present when using the dtd interface.

To Reproduce

  1. Add the --force-profile flag to PARSEC_PTGPP_FLAGS in parsec/CMakeLists.txt
  2. build command ../dplasma/configure --with-hwloc --with-mpi --with-blas=Intel10_64lp_seq --disable-debug -DPARSEC_PROF_TRACE=ON --prefix=$PWD/install
  3. ./testing_dgemm_dtd -N 3000
  4. turn the produced trace into an h5 file
  5. check on the trace in your favorite software
trace_xsdk_dtd_lfq_2400_reforce.event_types[-5:]

MPI_DATA_PLD_SND         18
MPI_DATA_PLD_RCV         19
parsec_dtd_data_flush    20
Gemm                     21
Fake_FIRST_OUT           22

tevents = trace_xsdk_dtd_lfq_2400_reforce.events
tevents.loc[tevents['type'] == 21].head()

  | dc_key | priority | dc_dataid | tcid | trc | m | n | node_id | stream_id | taskpool_id | type | begin | end | flags | id | k
-- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | -- | --
0 | 0 | 24 | 1 | 0 | None | None | 0 | 0 | 1 | 21 | 17248169 | 26838685 | 1 | 60 | None
0 | 0 | 24 | 1 | 0 | None | None | 0 | 0 | 1 | 21 | 26844096 | 27464702 | 1 | 62 | None
0 | 0 | 24 | 1 | 0 | None | None | 0 | 0 | 1 | 21 | 27467691 | 27778542 | 1 | 64 | None
0 | 0 | 24 | 1 | 0 | None | None | 0 | 0 | 1 | 21 | 27781289 | 28090761 | 1 | 66 | None
0 | 0 | 24 | 1 | 0 | None | None | 0 | 0 | 1 | 21 | 28093617 | 28354461 | 1 | 68 | None

</div>
DSMishler commented 1 year ago

Tried this on the most recent version of parsec (f8864b52b) and the issue persists

bosilca commented 1 year ago

Please try again now that #488 has been merged.

DSMishler commented 1 year ago

Issue still there for some reason - checked on commit 1eb84c2a8