Closed Mostraet closed 8 months ago
I have also encountered the same problem. Have you resolved it?
I have also encountered the same problem. Have you resolved it?
No, tried again today and couldn't get it to work, same error.
Got the same issue on 21.8.8, it just randomly stopped working, and throwing this error. I downloaded a new setup of kohya with version 22.0.1 which don't throw this error. I still would like to get 21.8.8 working to double check if its supposed to use 4 times as much vram when training XL then 1.5.
It's the latest windows update. I have 2 systems with the exact same config. They are used for SDXL training for months now under the 21.8.5 commit. One of my system got the new windows update today and I got this error right after the other one didn't receive the update and is still working fine.
I have the same issue.
Getting a new setup of kohya don't show the error, but my existing setup still had issues, so i redownloaded python and the error went away. No clue what could have gone wrong.
I solved it!
The reason for this problem is because the path to default_config.yaml is incorrect.
I found that the default_config.yaml referenced here is in C:{Users}\AppData\Local\huggingface\accelerate\default_config .yaml, (This is my case,You guys can see for yourselves where your default_config.yaml file is at)
open this file, comment out the debug parameter, then the problem is solved.
@Mostraet @tuwonga @Liyu96sc
thanks a lot! I got a new fresh setup and solved but this fix is awesome!
I tried pip install --upgrade accelerate
and it worked
I solved it!
The reason for this problem is because the path to default_config.yaml is incorrect.
I found that the default_config.yaml referenced here is in C:{Users}\AppData\Local\huggingface\accelerate\default_config .yaml, (This is my case,You guys can see for yourselves where your default_config.yaml file is at)
open this file, comment out the debug parameter, then the problem is solved.
@Mostraet @tuwonga @Liyu96sc
Thanks for your method! Can u provide some insights about how u find the method? It is not straightforward
Trying to train a LoRA on Linux Mint with nVidia GPU and 16 gigs of RAM, and I get the following error about debugging. I've run accelerate config, but that didn't help. Please help.
Full error log below.