Haidra-Org / horde-worker-reGen

The default client software to create images for the AI-Horde
https://aihorde.net/
GNU Affero General Public License v3.0
93 stars 42 forks source link

Flash Triton #333

Closed HPPinata closed 2 weeks ago

HPPinata commented 3 weeks ago

A newer (and less janky) version of flash_attn.

A bit more testing is required around changing the PyTorch version to 2.5.0 without breaking older setups.

Potential improvements:

HPPinata commented 2 weeks ago

This might still need a bit more work, hold on for now

HPPinata commented 2 weeks ago

@tazlin This should now be fine for a merge. Performance is as expected, memory usage slightly better than the present implementation and conda performance is roughly in line with the docker version. Stability is also improved, I'm getting a <1% process recovery rate.

Once support is merged upstream this will get another minor rework, but that might be months off.