NAT implementation(Neighborhood Attention Transformer)
This is an unofficial implementation. https://arxiv.org/pdf/2204.07143.pdf
original implemented:https://github.com/SHI-Labs/Neighborhood-Attention-Transformer
pytorch implementation and cuda(use cupy(please use CuPy v10+ and PyTorch 1.11+)) implementation.