Closed LiRunyi2001 closed 1 year ago
Update: I install xformers and re-trained the model with video size 512*512, but this weird error still happens.
please avoid using distributed training on multiple gpus. you may specify one gpu by export CUDA_VISIBLE_DEVICES=GPU_ID
.
please avoid using distributed training on multiple gpus. you may specify one gpu by
export CUDA_VISIBLE_DEVICES=GPU_ID
.
Thanks! I've tried this and it worked well.
Hi there! Due to limited GPU memory size, during training process it will trigger OOM, thus I turned to train with video size 256*256, batch_size=1. However, it leads to error like this:
I don't quite sure what to do with this. Any advise to this issue? Thanks!