Closed yzh119 closed 1 month ago
The indptr array length should be a upper-bound of batch_size + 1 in cuda graph mode.
indptr
batch_size + 1
The
indptr
array length should be a upper-bound ofbatch_size + 1
in cuda graph mode.