Open KopiSoftware opened 4 years ago
I have found the solution. Simply comment out non-existent config keys, which seems useless in this step. Here are the prove of my solution: The error still exists while the key is added in training command(displayed below). So I comment all these keys(3 altogether) and it works.
python -m torch.distributed.launch \
> --nproc_per_node=1 \
> tools/train_net.py \
> --config-file configs/fcos/fcos_imprv_R_50_FPN_1x.yaml \
> MODEL.FCOS.NORM_REG_TARGETS True
> DATALOADER.NUM_WORKERS 1 \
> OUTPUT_DIR training_dir/fcos_imprv_R_50_FPN_1x
Well,I think I made a mistake. The reason why the problem appears is that I downloaded the code from git while compiled and installed fcos with Tsinghua source. However the code from the source ain't the latest. So even if you commen out thses keys, it still doesn't work.
python setup.py build_ext --inplace
and python setup.py build_ext install
My GPU is 1060 6G version and I installed pytorch1.4. Should I return to torch1.0.1 to avoid this problem?