Blealtan / efficient-kan

An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
MIT License
3.49k stars 306 forks source link

KANs are MLPs? MLP Equivalent #8

Closed hypnopump closed 1 month ago

hypnopump commented 1 month ago

This note shows equivalence of KAN to MLP in the piecewise linear approximation. I guess non-linearity of spline might help in some cases, but would be cool to have it as a baseline. Here's the reddit discussion

minh-nguyenhoang commented 1 month ago

Only if you are using b-splines of order 1

Blealtan commented 1 month ago

It's equivalent to activating the same hidden state with multiple activation functions and then use a wider linear transformation to shrink it back. Somewhat like Gated Linear Unit, but somewhat in reverse: linear transformation goes after the broadening activation.