Closed siebeniris closed 11 months ago
Hi. I don't think this is an error with vec2text. It looks like either (1) you're launching DDP with only one GPU [via torchrun?] or (2) that print statement happens from within the training process, which in your setting should be executed in two concurrent processes each with one GPU.
Hi,
I think I did a mistake by using setting find_unused_parameters=True
. DDP does work well for precomputing hypotheses.
Thanks!
Hi, I would love to make the DDP work for training. But I directly got this warning by running the program:
The following is my bash script for running DDP in cloud with singularity container, is there something I should have setup but I didn't ? Any help would be appreciated.
--gres=gpu:a40:2
is set using singularity container.