Closed xfong closed 2 years ago
Improve parallelism of reduction kernel to improve speed with as little truncation errors as possible
Improve parallelism of reduction kernel to improve speed with as little truncation errors as possible