Closed radarFudan closed 11 months ago
I've updated the README.
The depthwise conv1d implemented here is equivalent to 1 line of Pytorch (see README), so you can always just call Pytorch for the cases not supported here. The goal of this repo is just to make it fast for some cases used in Mamba (and a few other model architectures).
Hello! Thank you so much for this helpful repo for mamba.
Questions: