megvii-research / RevCol

Official Code of Paper "Reversible Column Networks" "RevColv2"
Apache License 2.0
249 stars 10 forks source link

'tools/dist_train.sh' #10

Closed 836469383 closed 1 year ago

836469383 commented 1 year ago

First of all, I would say I think this is great work and I am very very interested. Then, I want to debug the segmentation task in the repository. But I didn't find the 'tools/dist_train.sh' when I followed the README.md. Any help will be appreciated.

nightsnack commented 1 year ago

Hi, just like that. I will commit it to the repo later.

CONFIG=$1
GPUS=$2
NNODES=${NNODES:-1}
NODE_RANK=${NODE_RANK:-0}
PORT=${PORT:-29500}
MASTER_ADDR=${MASTER_ADDR:-"127.0.0.1"}

torchrun \
    --nnodes=$NNODES \
    --master_addr=$MASTER_ADDR \
    --nproc_per_node=$GPUS \
    --master_port=$PORT \
    $(dirname "$0")/train.py \
    $CONFIG \
    --launcher pytorch ${@:3}
836469383 commented 1 year ago

Thanks for your answer. Best wishes.

836469383 commented 1 year ago

oh, when will you update the repo 030

nightsnack commented 1 year ago

oh, when will you update the repo 030

Now.

836469383 commented 1 year ago

♥ & starred