Xilinx / ACCL

Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators
https://accl.readthedocs.io/
Apache License 2.0
81 stars 26 forks source link

Warning for ACCL operations on too large data buffers #107

Closed TristanLaan closed 1 year ago

TristanLaan commented 2 years ago

The behaviour of ACCL is undefined when doing operations where the data being exchanged is larger than the size of the rx buffers. This is an easy mistake to overlook, and results in the CCLO hanging without clear cause. It would be helpful if the ACCL driver would issue a warning when trying to perform an operation that is too large to fit in the rx buffers.