bespoke-silicon-group / bsg_replicant

BSG Replicant: Cosimulation and Emulation Infrastructure for HammerBlade
BSD 3-Clause "New" or "Revised" License
26 stars 20 forks source link

Adding multikernel example #806

Open dpetrisko opened 2 years ago

dpetrisko commented 2 years ago

This test demonstrates launching multiple kernels and synchronizing between them in an extremely primitive way. The generic kernel simply accepts an input/output buffer and a synchronization location. It then modifies the buffer and synchronizes to the barrier when done.

Turns out, this totally just works using the CUDA-lite infrastructure, which is great. But this example will demonstrate to users how to do it, as well as provide regression. Would appreciate any style feedback or comment requests