bespoke-silicon-group / bsg_manycore

Tile based architecture designed for computing efficiency, scalability and generality
Other
227 stars 58 forks source link

Updating manycore for cache dma #622

Closed dpetrisko closed 2 years ago

dpetrisko commented 2 years ago

Corresponds to https://github.com/bespoke-silicon-group/basejump_stl/pull/410

tommydcjung commented 2 years ago

I would say launch a CI branch to make sure that all existing configurations pass.

dpetrisko commented 2 years ago

@tommydcjung I launched a CI run, but it failed because the basejump PR has not merged yet. What would you suggest for such a situation?

tommydcjung commented 2 years ago

I see. In this case, the regression needs to run locally and get both PR approved and merged. I can help with that. @drichmond can you run some regression on cuda-lite to validate that hbm2 dma mapping is not affected?

drichmond commented 2 years ago

yeah, which PRs need to be run?

tommydcjung commented 2 years ago

Found bug with pod_1x1_hbm2

Error-[EEST] $error elaboration system task
  msg: WH len width 4 must be large enough to hold the dma transfer size 5
  location: file
  //basejump_stl/bsg_cache/bsg_wormhole_to_cache_dma_fanout.v
  line 363
  path: spmd_testbench.tb.hbm2.hs[1].py[0].py[3].row[1].rf[0].wh_to_dma
dpetrisko commented 2 years ago

Thanks, fixed the bug. There was an incorrectly calculated DMA length which was exposed by the wider HBM channels

tommydcjung commented 2 years ago

@drichmond can you run some regression with this PR and https://github.com/bespoke-silicon-group/basejump_stl/pull/410?

drichmond commented 2 years ago

Yeah, seems to run fine once https://github.com/bespoke-silicon-group/bsg_replicant/pull/764 is applied.

This is just a refactor, not a functionality change right? No memory mapping changes?

tommydcjung commented 2 years ago

I need to look into why interpod_memory_test is hanging for pod_4x4_hbm2. It's not caused by this PR. it was already hanging before.

tommydcjung commented 2 years ago

Made a fix here https://github.com/bespoke-silicon-group/bsg_manycore/pull/626