Closed JoseMoFi closed 2 years ago
Hi, @JoseMoFi , it seems like its about your environment setting, because the error occurs in the forward of ResNet. Perhaps you can check your envionment first and then an input validation may be helpful.
I use WSL 2, could it be the problem? And thank you for the help!
I'm not familar with WSL 2, all experiments are conducted on ubuntu. Can WSL 2 detect the GPU device?
Yes, WSL 2 can detect the GPU device. However, I think the problem should be WSL 2 because I had similar error in other repo when I was training and now I test again but in W10 and it work, so... I'll do more test, but it is very probable who the problem must be WSL 2 or some config. If I find something I'll post here. And really thank you for the help!
@JoseMoFi I suggest you go straight install Ubuntu
rather than wasting your time to set this up on W10
(been there myself & I ended up installing Ubuntu 😢)
This code works well on Ubuntu, even on the Nvidia DGX-1
environment ✌🏼
Ok, I am secure that the problem was WSL 2. However, I don't know if it's because I have bad config CUDA or if WSL can't work with the graphic card. But I use other code that neither work in WSL but it can work on server with Ubuntu. So I can say thay my problem is caused by WSL. Thank you for the help!
Hello, I'm replicating this model but when I execute the command for do the inferece an unknowns error appears. However, I don't know why I have this error. My setup it's:
The complete error is:
And I have change the config file: -batch_size: 2 +batch_size: 1 -test_batch_size: 8 -num_worker: 10 -device: 0,1,2 +test_batch_size: 1 +num_worker: 1 +device: 0
Also my torch version its
1.8.1+cu111
Thank you for the help!
UPDATE
Also i found this error:
whit the next config -batch_size: 2 +batch_size: 1 random_seed: 0 -test_batch_size: 8 -num_worker: 10 -device: 0,1,2 +test_batch_size: 2 +num_worker: 2 +device: 0