Closed ifeherva closed 1 year ago
Thanks for this! This framework looks easy to use but it doesn't support depthwise or grouped convolution, while many CNNs rely on them (resnext, convnext, mobilenet, etc.). We are still using masked conv for now since we found no framework currently has a sufficiently efficient implementation (i.e., faster than masked conv in torch) of sparse depthwise/grouped convolution on GPUs.
How about torch sparse++?
This repo is using masked dense convolutions because it is optimized in torch. However, would this implementation speed things up or too complicated getting this work here?