Closed vinx13 closed 1 week ago
This makes the allocation go through memory planning and make it compatible with cuda graph.
cc @yongwww @masahi
@vinx13 please take a look at lint
This makes the allocation go through memory planning and make it compatible with cuda graph.
cc @yongwww @masahi