Open cuuupid opened 6 years ago
The default training script for single GPU scripts/train_256_g1.sh
also gives the first error.
Alas, I don't recognize these errors.
I also met this problem when using AWS with K80 GPU. It turns out to be a problem when installing flownet. In the 3 setup.py, the K80 arch is not added, making the complied file not compatible with K80.
Add or change a line into '-gencode', 'arch=compute_37,code=sm_37'
in all 3 setup.py's then install flownet solved the problem.
Refer to https://github.com/NVIDIA/flownet2-pytorch/issues/33
@WangzhiDai I'm currently trying to overcome this problem on Google Colab, which also has a K80 GPU. Did you just have to do that for installing flownet, but not for download_models_flownet2?
@WangzhiDai I'm currently trying to overcome this problem on Google Colab, which also has a K80 GPU. Did you just have to do that for installing flownet, but not for download_models_flownet2?
Did you manage to run code on Google Colab ?
Did anyone try this in colab? I am getting a similar error.
When trying to run Pose2Body using the following setup:
--fg
because it is pose2body so there is no label map or foreground/background separationI get the following error about CUDA:
I thought this was a GPU/CUDA issue, but I am using the AWS Deep Learning Image so everything is setup correctly, and also Torch seems to work fine with CUDA by the below in Python interactive mode:
I tried removing
label_nc 0
, but this gave rise to a bigger error:The following is the network info printed out by
train.py
:Using a Tesla K80 with 12GB, CUDA 9.0, CUDNN 7.0.5, Ubuntu 16.04