Open YuNaruto opened 3 years ago
@ajtao the path logs/dump_folder/hidden-puffin_2021.04.27_17.17/code is available
Notice that we're cd
-ing into the logs/
directory but the command executed, as defined in scripts/dump_folder.yml
is:
python -m torch.distributed.launch --nproc_per_node=1 train.py
However, train.py
is defined at the project directory level. Changing the relative path to an absolute path should fix this issue. That is:
CMD: "python -m torch.distributed.launch --nproc_per_node=1 {absolute path to train.py}"
In my case this was:
CMD: "python -m torch.distributed.launch --nproc_per_node=1 /home/ubuntu/semantic-segmentation/train.py"
There's something wrong with (both) of your setups. What runx is attempting to do is:
See also #86
when i run this: python -m runx.runx scripts/dump_folder.yml -i , go wrong . but the path is available. how this?