Xilinx / ACCL

Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
https://accl.readthedocs.io/
Apache License 2.0
81 stars 26 forks source link

Copy fails with Coyote implementation #167

Closed mar-ven closed 5 months ago

mar-ven commented 10 months ago

With the Coyote implementation, copy seems not to work unless the src and dst buffers are both synced before invoking the copy() call, via sync_to_device(). In particular, it happens that the content for a float32 dst buffer after copy appears full of 0s, so no operation performed on it, unless the sync_to_device() is performed before and after the copy() call.