chenziwenhaoshuai / Vision-KAN

KAN for Vision Transformer
MIT License
233 stars 17 forks source link

Kolmogorov-Arnold Transformer #20

Closed Adamdad closed 1 month ago

Adamdad commented 2 months ago

KAN was strong but faced scalability issues. We tackled this with 3 simple tricks. By combining KAN with Transformers, we've built a much stronger and more scalable model. 💪

📄 Paper: https://arxiv.org/abs/2409.10594 💻 Code: https://github.com/Adamdad/kat

chenziwenhaoshuai commented 1 month ago

awesome work!