LuxDL / LuxLib.jl

Backend for Lux.jl
MIT License
9 stars 0 forks source link

test: improved batched matmul & LV handling #121

Closed avik-pal closed 3 months ago

avik-pal commented 3 months ago

This gives massive performance boast for small matrices/large batches while the performance eventually matches that of batched_mul at the large scale where their strategy is good enough.

Features

Fixes

Tests