Open nizu-alt opened 6 months ago
Same error when training with Chinese. Win 10, CUDA 12.1, RTX 3090
[Training] [2024-05-28T17:33:02.323483] C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\aten\src\ATen\native\cuda\Indexing.cu:1289: block: [144,0,0], thread: [95,0,0] Assertion srcIndex < srcSelectDimSize
failed.
[Training] [2024-05-28T17:33:03.025133] Disabled distributed training.
[Training] [2024-05-28T17:33:03.025133] Path already exists. Rename it to [./training\lt\finetune_archived_240528-173133]
[Training] [2024-05-28T17:33:03.026133] Loading from ./models/tortoise/dvae.pth
[Training] [2024-05-28T17:33:03.027132] Traceback (most recent call last):
[Training] [2024-05-28T17:33:03.028131] File "D:\ai-voice-cloning\src\train.py", line 72, in TORCH_USE_CUDA_DSA
to enable device-side assertions.
[Training] [2024-05-28T17:33:03.066119]
Anyone figured out how to fix this error? I'm training Korean and faced the exact same error message
Possible latent mismatch: click the "(Re)Compute Voice Latents" button and then try again. Error: CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1. Compile with
TORCH_USE_CUDA_DSA
to enable device-side assertions.