issues
search
nicknytko
/
numml
MIT License
12
stars
2
forks
source link
Paper-related optimizations
#7
Closed
nicknytko
closed
1 year ago
nicknytko
commented
1 year ago
Speed up construction from COO by using bincount instead of a Python loop
Rewrite CPU SpSpMM forward pass to use the implementation in Scipy and SMMP
Speed up GPU SpSpMM backward pass by using bincount to count row nonzeros instead of using a lazily-implemented kernel