DeepLink-org / DeepLinkExt

BSD 3-Clause "New" or "Revised" License
11 stars 0 forks source link

feat: wx support adamw and fix fallback of varlen flash attention #113

Closed POI-WX closed 3 months ago

POI-WX commented 3 months ago

Support adamw and fix fallback of varlen flash attention.