Closed Entretoize closed 8 months ago
Of course its not, tons of people have training broken and slow speeds like this, nobody has answer to this, its only with sdxl which is supposed to be easy to train
Not a bug, your parameters are using more VRAM than your GPU has, so the training script starts using regular RAM as well, which makes the training incredibly slow. Review your parameters, optimizer, etc.
After applying all optimization listed on the kohya website, I'm able to keep vram usage below 9GB and traing run at 7s/it, so it seems you where true. Thanks !
@Entretoize go to https://www.nvidia.com/en-us/geforce/drivers select your gpu and downgrade to version 531 and that will fix your issue I had a new ver b4 and had like 6-12 s/it now its down to like ~1.2-.09s/it
I can't confirm, I'm still at 7s/it with a 531 version, maybe it depends on the RTX you have...
@Entretoize go to https://www.nvidia.com/en-us/geforce/drivers select your gpu and downgrade to version 531 and that will fix your issue I had a new ver b4 and had like 6-12 s/it now its down to like ~1.2-.09s/it
This did it for me, went from 12.4s/it to 1.9s/it However, 531 is now only available under studio drivers.
Version 531 also worked for my RTX 3090. Jumped from 12it/s to 1.3it/s. Thanks for the tip!
I followed a tutorial to train a lora model with Kohya for SDXL. The best I can get is 73s/it which seems slow but maybe it's normal ? I already tried another tutorial for SD1.5 and it was fast so I think it is an issue.
Here's my cmd history:
I have an RTX 3080 GPU (10GB+16GB shared) and while training it uses the whole dedicated memory plus 2.5GB of the shared one but from my other tests with SD1.5 it doesn't seems to be a problem.
From what I read, speed should be 1 or 2s/it, I also read about a guy that reinstalled some package and solved this problem, but what to reinstall ? Other guys said they had this kind of issue with the last version, I'm confused...