Closed djl11 closed 6 years ago
Many thanks! I changed the code as suggested. Sorry for taking so long to respond - I am very busy with new projects.
@simonmeister @djl11
Hi,
Thanks for help in this issue.
It seems that I still face this problem (cuda related) now. My settings are: cuda 8.0 (I work on server, which means cuda is installed in shared path), tensorflow-gpu 1.10.
My errors are as followings, I very appreciate your help if you have some time.
nvcc warning : The 'compute_20', 'sm_20', and 'sm_21' architectures are deprecated, and may be removed in a future release (Use -Wno-deprecated-gpu-targets to suppress warning).
In file included from /home/runzeli/anaconda3/lib/python3.5/site-packages/tensorflow/include/tensorflow/core/util/cuda_kernel_helper.h:21:0,
from backward_warp_op.cu.cc:8:
/home/runzeli/anaconda3/lib/python3.5/site-packages/tensorflow/include/tensorflow/core/util/cuda_device_functions.h:32:31: fatal error: cuda/include/cuda.h: No such file or directory
compilation terminated.
Traceback (most recent call last):
File "/data2/Runze/UnFlow/src/e2eflow/ops.py", line 59, in
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "run.py", line 7, in
it has been solved, thank. :)
@bragilee Excuse me, could you tell me how to solve this issue? I have exactly the same problem but have no idea how to fix it...
Thank you very much!
Hi @DeckerDai ,
sorry about late. It has been a long time since I did not work on this problem. Previously I tried to run this repo then I moved to other projects. For my case, I only make the cuda version+cudnn version compatible with this repo. I am not quite sure about what other factors will have effects on compilation, for example, gcc. I suggest you can start to follow exactly the same versions used in this repo.
Thank you.
If cuda is not installed in 'usr/local', it does not run correctly. This is due to lines 42 and 46 in ops.py
Note also that lines 37-40 are not used in ops.py, and therefore neither is line 17 in config.ini.
My simple fix to this was to replace lines 37 to 46 in ops.py with the following:
I also removed line 17 from config.ini
Finally, I know this is a custom config file, but I would also suggest perhaps changing line 15 in config.ini to
g++ = g++
I have tested on 3 different machines, all with different versions of linux, tensorflow, and cuda. With these small changes, your code runs immediately following a clone from this repo on all of them (after copying over the config.ini file of course.)
Just a few small suggestions to make things as out-the-box as possible! :)