Open Luedeke opened 5 years ago
I fixed this Problem with: sudo apt-get install libopencv-dev But is there another solution for virtualenv with pip ?
Hi, I'm running sh make.sh and there's a mistake like this:
ImportError: libcublas. so.9.0: cannot open shared object file: No such file or directory
What is the reason for this? My usr/local/cuda/lib64 directory has the file libcudlas.so.9.0, but why did he suggest that he could not find it? I used to use 9.1 cuda, but now because to compile poseCNN, there is a version of CUDA 9.0, cudnn 7.0, tensorflow is the version of GPU 1.5.0, Python 2.7, can you help me? Thank you
@yuzw213
Im using: cuda 9.0 cuddn 7.0 tensorflow-gpu (from source!!!!) r1.8 and of course Python 2.7.
Cuda 9.0 (Tensorflow requires 9.0!) https://developer.nvidia.com/cuda-90-download-archive?target_os=Linux&target_arch=x86_64&target_distro=Ubuntu&target_version=1604&target_type=deblocal
Installation Instructions:
sudo dpkg -i cuda-repo-ubuntu1604-9-0-local_9.0.176-1_amd64.deb
sudo apt-key add /var/cuda-repo-<version>/7fa2af80.pub
sudo apt-get update
sudo apt-get install cuda
If you installed it with run file or something else you should read: http://developer.download.nvidia.com/compute/cuda/9.0/Prod/docs/sidebar/CUDA_Installation_Guide_Linux.pdf
1.1 Add with nano ~/.bashrc the following line: export PATH=/usr/local/cuda-9.0/bin${PATH:+:${PATH}}
then install cuddn https://developer.nvidia.com/rdp/cudnn-download2.3.2. Installing from a Debian File
Navigate to your directory containing cuDNN Debian file.
Install the runtime library, for example:
sudo dpkg -i libcudnn7_7.0.3.11-1+cuda9.0_amd64.deb
Install the developer library, for example:
sudo dpkg -i libcudnn7-dev_7.0.3.11-1+cuda9.0_amd64.deb
Install the code samples and the cuDNN Library User Guide, for example:
sudo dpkg -i libcudnn7-doc_7.0.3.11-1+cuda9.0_amd64.deb
Dont forgett to remove the old 9.1 repository https://help.ubuntu.com/community/Repositories/Ubuntu
Or maybe something from this is helpful: https://github.com/tensorflow/tensorflow/issues/17629 https://github.com/tensorflow/tensorflow/issues/15604 -> Maybe you made a mistake in the tensorflow installation?
I didn't have this bug and I'm also new in this area, so I can't help anymore.
thank you for reply! i will try at once.
But is there a requirement for tensorflow version? Can I compile tensorflow-gpu 1.12.0 with source code?
Also, must the version of gcc/g++ be 4.8.5? I use 5.4.0.
@yuzw213 Yes there are requiremeints: Required environment Ubuntu 16.04 Tensorflow >= 1.2.0 CUDA >= 8.0 from here: https://github.com/yuxng/PoseCNN I would recomment you to use, tensorflow 1.8, with this installation steps: https://github.com/Kaju-Bubanja/PoseCNN.
And you can test it with 5.4.0 but it was not working for me. The build.sh crashed with this also the kinect_fusion build. With gcc/g++ 4.8.5 it worked for me. If you change back to 5.4.0 just rebuild (cmake) the Dependencies again, thats it (Maybe you have to build tensorflow again dunno).
Thanks ! I will try again
@Luedeke Hi again. I am new so, I apologize for the silly question. How do I get gcc/g++ 4.8.5 and then switch from 5.4. I currently have gcc/g++ 5.4. Do I need to uninstall 5.4 then install 4.8.5 or is there some other way?
@Abdul-Mukit No Problem, I'm also new. This nice linux command was helpful for me:
sudo apt-get install gcc-4.8 sudo apt-get install g++-4.8
The 10 and 30 are priorities for the auto config which you will se after this step. sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-4.8 10 sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-5 30
sudo update-alternatives --config gcc change to 4.8! so it should be 1
sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-4.8 10 sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-5 30
sudo update-alternatives --config g++ change to 4.8! so it should be 1
gcc --version g++ --version
@Luedeke Compile and report such errors:
make.sh: 10: make.sh: nvcc: not found g++: error: hard_label_op.cu.o: 没有那个文件或目录 hard_label_layer make.sh: 21: make.sh: nvcc: not found g++: error: gradient_reversal_op.cu.o: 没有那个文件或目录 gradient_reversal_layer make.sh: 32: make.sh: nvcc: not found
It reminds me that I can't find nvcc, but I can execute nvcc command under terminal. Do you know why?
@yuzw213 how did you build Tensorflow ? which cuda version have you installed? I recommend you to install tensorflow from source with r1.8 and with Cuda 9,0. I had a similiar error, because of Tensorflow. Read this installation guide, its realy good: https://github.com/Kaju-Bubanja/PoseCNN/tree/docu_updated
Thank you ! I will change the 1.8 version to try again
Installation guide for beginners!
IMPORTANT! MORE OTHER IMFORMATIONS FROM Kaju-Bubanja MAYBE HERE: https://github.com/Kaju-Bubanja/PoseCNN/tree/docu_updated
I didnt trained it i only used it 1 time. Atm I'm trying to use the new CNN also from Xiang from here: https://github.com/NVlabs/Deep_Object_Pose I used: Cuda 9.0 with a Patch, Cuddn 7.0, tensorflow-gpu 1.8 (from Source!), Ubuntu 16.04
utor from PoseCNN: yuxng commented on May 30 I used ubuntu 16.04, tensorflow 1.5.0, and cuda 9.1 https://github.com/yuxng/PoseCNN/issues/13
sudo apt-get install libopencv-dev
sudo apt-get install virtualenv
virtualenv /home/USER/python-test
source /home/USER/python-test/bin/activate
pip install mock pip install matplotlib numpy keras Cython Pillow easydict transforms3d sudo apt-get install libsuitesparse-dev OpenEXR libopenexr-dev metis libmetis-dev
pip install enum34
git clone https://github.com/tensorflow/tensorflow.git cd tensorflow
git checkout branch_name # r1.9, r1.10, etc.
bazel test -c opt -- //tensorflow/... -//tensorflow/compiler/... -//tensorflow/contrib/lite/...
bazel test -c opt -- //tensorflow/... -//tensorflow/compiler/... -//tensorflow/lite/...
./configure
bazel build --config=opt --config=cuda //tensorflow/tools/pip_package:build_pip_package
./bazel-bin/tensorflow/tools/pip_package/build_pip_package /tmp/tensorflow_pkg
pip install /tmp/tensorflow_pkg/tensorflow-1.8.0-cp27-cp27mu-linux_x86_64.whl
sudo sh -c 'echo "deb http://packages.ros.org/ros/ubuntu $(lsb_release -sc) main" > /etc/apt/sources.list.d/ros-latest.list'
sudo apt-key adv --keyserver hkp://ha.pool.sks-keyservers.net:80 --recv-key 421C365BD9FF1F717815A3895523BAEEB01FA116
sudo apt-get update
sudo apt-get install ros-kinetic-desktop-full
apt-cache search ros-kinetic
sudo rosdep init rosdep update
echo "source /opt/ros/kinetic/setup.bash" >> ~/.bashrc source ~/.bashrc
sudo apt install python-rosinstall python-rosinstall-generator python-wstool build-essential
sudo apt-get install gcc-4.8 sudo apt-get install g++-4.8
sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-4.8 10 sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-5 30
sudo update-alternatives --config gcc
sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-4.8 10 sudo update-alternatives --install /usr/bin/g++ g++ /usr/bin/g++-5 30
sudo update-alternatives --config g++
gcc --version g++ --version
wget https://bitbucket.org/eigen/eigen/get/3.3.0.zip extract 3.3.0.zip rename to eigen rm 3.3.0.zip cd eigen mkdir build && cd build cmake .. make sudo make install
wget https://github.com/jlblancoc/nanoflann/archive/ad7547f4e6beb1cdb3e360912fd2e352ef959465.zip
extrakt ad7547f4e6beb1cdb3e360912fd2e352ef959465.zip rename nanoflann-ad7547f4e6beb1cdb3e360912fd2e352ef959465 nanoflann rm ad7547f4e6beb1cdb3e360912fd2e352ef959465.zip
sudo apt-get install build-essential cmake libgtest-dev libeigen3-dev #maybe not necesesary if you isntalled eigen from source already? cd nanoflann mkdir build && cd build && cmake .. make && make test sudo make install
wget https://github.com/stevenlovegrove/Pangolin/archive/1ec721d59ff6b799b9c24b8817f3b7ad2c929b83.zip extrakt 1ec721d59ff6b799b9c24b8817f3b7ad2c929b83.zip rn Pangolin-1ec721d59ff6b799b9c24b8817f3b7ad2c929b83 Pangolin rm 1ec721d59ff6b799b9c24b8817f3b7ad2c929b83.zip
sudo apt-get install libglew-dev
sudo apt-get install cmake
sudo apt-get install ffmpeg libavcodec-dev libavutil-dev libavformat-dev libswscale-dev libavdevice-dev
sudo apt-get install libdc1394-22-dev libraw1394-dev
sudo apt-get install libjpeg-dev libpng12-dev libtiff5-dev libopenexr-dev
sudo apt-get install doxygen doxygen-doc doxygen-gui graphviz
cd Pangolin
mkdir build cd build cmake .. cmake --build .
cmake --build . --target doc
wget https://dl.bintray.com/boostorg/release/1.67.0/source/boost_1_67_0.tar.bz2 extract boost_1_67_0.tar.bz2 rm boost_1_67_0.tar.bz2
wget https://github.com/strasdat/Sophus/archive/ceb6380a1584b300e687feeeea8799353d48859f.zip rm ceb6380a1584b300e687feeeea8799353d48859f.zip rn Sophus-ceb6380a1584b300e687feeeea8799353d48859f Sophus cd Sophus mkdir build && cd build cmake .. make sudo make install
wget https://github.com/stevengj/nlopt/archive/74e647b667f7c4500cdb4f37653e59c29deb9ee2.zip extract 74e647b667f7c4500cdb4f37653e59c29deb9ee2.zip rm 74e647b667f7c4500cdb4f37653e59c29deb9ee2.zip rename nlopt-74e647b667f7c4500cdb4f37653e59c29deb9ee2 nlopt cd nlopt mkdir build cd build cmake .. make sudo make install
git clone https://github.com/Kaju-Bubanja/PoseCNN.git cd PoseCNN/lib
cd kinect_fusion
mkdir build
cd build
cmake ..
make
cd ..
cd synthesize
mkdir build
cd build
cmake ..
make
./experiments/scripts/demo.sh #It will automatically use gpuid 0!
./experiments/scripts/demo.sh --gpuid 0 #Spaces are important!
cd $ROOT/data/LOV ln -s $ycb_data data ln -s $ycb_models models
cd $ROOT
./experiments/scripts/lov_color_2d_train.sh $GPU_ID
./experiments/scripts/lov_color_2d_test.sh $GPU_ID
I fixed this Problem with: sudo apt-get install libopencv-dev But is there another solution for virtualenv with pip ?
The most correct one
Hello,
If i do sh make.sh I'm getting the following error:
but: build computing label layer
So is this the normal behavior or did i miss something? I dont think so, but how can i resolve that?
I installed opencv with pip:
After that I get with: ./experiments/scripts/demo.sh $GPU_ID this error: /gpu:0 Traceback (most recent call last): File "./tools/demo.py", line 118, in from networks.factory import get_network
File "/home/nils/test/PoseCNN/tools/../lib/networks/init.py", line 8, in
from .vgg16 import vgg16
File "/home/nils/test/PoseCNN/tools/../lib/networks/vgg16.py", line 2, in
from networks.network import Network
File "/home/nils/test/PoseCNN/tools/../lib/networks/network.py", line 17, in
import hough_voting_layer.hough_voting_op as hough_voting_op
File "/home/nils/test/PoseCNN/tools/../lib/hough_voting_layer/hough_voting_op.py", line 5, in
_hough_voting_module = tf.load_op_library(filename)
File "/home/nils/test/local/lib/python2.7/site-packages/tensorflow/python/framework/load_library.py", line 56, in load_op_library
lib_handle = py_tf.TF_LoadLibrary(library_filename)
tensorflow.python.framework.errors_impl.NotFoundError: /home/nils/test/PoseCNN/tools/../lib/hough_voting_layer/hough_voting.so: cannot open shared object file: No such file or directory
Kinect_fusion build succesfully, if i build synthesize:
(test) nils@Nils-Desktop-U:~/test/PoseCNN/lib/synthesize/build$ cmake .. -- OpenCV ARCH: -- OpenCV RUNTIME: -- OpenCV STATIC: ON CMake Warning at /home/nils/test/lib/python2.7/site-packages/cv2/OpenCVConfig.cmake:166 (message): Found OpenCV Windows Pack but it has no binaries compatible with your configuration.
You should manually point CMake variable OpenCV_DIR to your build of OpenCV library. Call Stack (most recent call first): CMakeLists.txt:28 (find_package)
CMake Error at CMakeLists.txt:28 (find_package): Found package configuration file:
but it set OpenCV_FOUND to FALSE so package "OpenCV" is considered to be NOT FOUND.
-- Configuring incomplete, errors occurred! See also "/home/nils/test/PoseCNN/lib/synthesize/build/CMakeFiles/CMakeOutput.log". See also "/home/nils/test/PoseCNN/lib/synthesize/build/CMakeFiles/CMakeError.log".
Thx for your help.