seeder-research / uMagNUS

Other
6 stars 3 forks source link

Reduction optimization #17

Closed xfong closed 2 years ago

xfong commented 2 years ago

Improve parallelism of reduction kernel to improve speed with as little truncation errors as possible