Open rosarioscavo opened 3 years ago
I ran into a similar but not exactly the same issue when training with multiple nodes with 1 GPU per node (but not with multigrid). My workaround is to always initialize distributed training by commenting out these two lines
This may help with the change you made
I think the problem was that I tried to use the multigrid with only 1 GPU because with the "standard" file configuration and the "standard" model for charades it worked without any edits.
Hello,
Even i am facing the same problem. I checked in the Multigrid Paper that they have shown results for 1 GPU setting on Kinetics dataset. Any guidance will be helpful. Thank You
i think u can change
sampler = DistributedSampler(dataset) if cfg.NUM_GPUS > 1 else None
to
sampler = DistributedSampler(dataset) if cfg.NUM_GPUS > 1 else RandomSampler(dataset)
it works on my machine
Hi, I downloaded the Charades dataset and tried to train the dataset with the command:
python tools/run_net.py --cfg configs/Charades/SLOWFAST_16x8_R50_multigrid.yaml DATA.PATH_TO_DATA_DIR ../Charades_v1_rgb/
Considering that I've only 1 GPU, I edited the .yaml NUM_GPUS parameter to 1. So the .yaml file configuration is:
Running the command, I get this error:
The sampler is created using the create_sampler function: https://github.com/facebookresearch/SlowFast/blob/fd41618191d3c21c1ca21a61369ce9917646cf9c/slowfast/datasets/loader.py#L87
the create_sampler is defined as follows: https://github.com/facebookresearch/SlowFast/blob/fd41618191d3c21c1ca21a61369ce9917646cf9c/slowfast/datasets/utils.py#L304-L318
I tried to change
to
Doing that the error changed in:
Do you know a way to be able to use SlowFast with only one GPU or am I doing something incorrectly? Thank you!