liuliu / ccv

C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Library
http://libccv.org
Other
7.07k stars 1.71k forks source link

Custom Layer Normalization Metal Kernel #253

Closed philipturner closed 1 year ago

philipturner commented 1 year ago

IO-aware exact and approximate normalization. Only applied to layer normalization right now.

Archived draft with the full FlashNorm implementation: