Implemented abstract base class and derived template class for connections. Implemented two template specializations (12 bytes and 16 bytes). Reduced gpu memory overhead for remote connection creation. Improved MPI remote connection memory checker with automatic check and summary of the results.
Implemented abstract base class and derived template class for connections. Implemented two template specializations (12 bytes and 16 bytes). Reduced gpu memory overhead for remote connection creation. Improved MPI remote connection memory checker with automatic check and summary of the results.