mindspore-lab / mindone

one for all, Optimal generator with No Exception
Apache License 2.0
329 stars 63 forks source link

Impr stdit training: new FA, vae cached, optimized qkv split-transpose order #459

Closed SamitHuang closed 2 months ago

SamitHuang commented 2 months ago

What does this PR do?

Fixes # (issue) Training with FA, allow 512x512x32 training with RC+FA. The new FA requires MS2.3 >= 20240422

Adds # (feature) VAE caching

Before submitting

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.

@xxx