Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
1.41k
stars
208
forks
source link
How to run programs on multiple machines, such as 4 machines with 8 gpus(4*8=32 in total)? #46
Closed
raojay7 closed 2 years ago
export MASTER_ADDR=$DIST_0_IP export MASTER_PORT=$DIST_0_PORT export NODE_RANK=$DIST_RANK python run.py with data_root= num_gpus= num_nodes= task_mlm_itm whole_word_masking=True step200k per_gpu_batchsize=
ex) python run.py with data_root=/data2/dsets/dataset num_gpus=8 num_nodes=1 task_mlm_itm whole_word_masking=True step200k per_gpu_batchsize=64
How do I set up these codes and what other operations need to be done?