Closed fake-warrior8 closed 2 years ago
nodes are machines. most machines only have up to 4 or 8 gpus. if you want to train on more than that, you'll have to use multiple machines, in which case, you need to specify the IP address of the master node (in this case, node0).
if you want to run on a single machine (eg 4 or 8 gpus), then yes, you can simply run:
>> python main-avid.py configs/main/avid/kinetics/Cross-N1024.yaml --dist-url tcp://localhost:1234 --multiprocessing-distributed --world-size 1 --rank 0
Thank you!
Hi, Could you tell me what "node0>>" and "node1>>" mean in the following run commands
. I can't find the introduction of "node0>>" in Google or in the link you gave documentation. This documentation only gives an example of
So can I just remove the "node k >>"?