Closed ArmenLevoni closed 4 years ago
@ArmenLevoni This looks like a problem with pytorch, where even though we aren't using GPU, it still fails because CUDA is too old. One fix is to update your CUDA drivers or uninstall CUDA (use the non-CUDA docker). But pytorch also shouldn't be checking the CUDA driver when gpu mode is turned off, so you could verify that the pytorch examples fail and submit a bug to them if you want to chase it down: https://github.com/pytorch/examples/blob/master/mnist/main.py
Whoops, clicked the wrong button. Feel free to reply or close if the issue is done, thanks.
Thanks, @MisterTea, I tried to update CUDA drivers, I build both versions docker docker/cpu.Dockerfile and cuda.Dockerfile, results are the same. I run these on MacOs Mojave. Pytoarch example does not work:
from torchvision import datasets, transforms Traceback (most recent call last): File "
", line 1, in ModuleNotFoundError: No module named 'torchvision'
Actually after installing: conda install torchvision it updates pytorch to: pytorch-1.0.1 |py3.6_cuda9.0.176_cudnn7.4.2_2 320.5 MB pytorch
and it starts working. Most probably pytorch-nightly (from requirements.txt) is broken.
Later on, with pytorch-1.0.1 version, it fails in this stage: ~/Horizon/Horizon# python ml/rl/workflow/dqn_workflow.py -p ml/rl/workflow/sample_configs/discrete_action/dqn_example.json getting error:
File "/home/miniconda/lib/python3.6/site-packages/torch/utils/data/_utils/worker.py", line 11, in <module>
from torch._six import queue
ImportError: cannot import name 'queue'
In the: https://github.com/facebookresearch/Horizon/blob/master/requirements.txt can pytorch-nightly be replaced with a correct version that the provided examples will work with?
Let me see if I can keep pytorch up to date in our integration tests so we can catch these errors...
I followed instructions from here: https://github.com/facebookresearch/Horizon/blob/master/docs/installation.md to run Docker image on Mac. However when I am running the example, getting following error:
How can I configure to run the example on the CPU?