torch / cutorch

A CUDA backend for Torch7
Other
337 stars 208 forks source link

no default constructor error on install #682

Closed Nyrt closed 7 years ago

Nyrt commented 7 years ago

I have been trying to update cutorch, since I've been having issues with torch that appear to have been fixed in the lastest version, but I am unable to install. I've been using "luarocks install cutorch" to install, which has been failing with the following results: (sorry about the huge code block, I'm not sure what's useful.)

nathan@nathan-ubuntu:~$ luarocks install cutorch
Installing https://raw.githubusercontent.com/torch/rocks/master/cutorch-scm-1.rockspec...
Using https://raw.githubusercontent.com/torch/rocks/master/cutorch-scm-1.rockspec... switching to 'build' mode
Cloning into 'cutorch'...
remote: Counting objects: 220, done.
remote: Compressing objects: 100% (174/174), done.
remote: Total 220 (delta 60), reused 93 (delta 44), pack-reused 0
Receiving objects: 100% (220/220), 211.56 KiB | 0 bytes/s, done.
Resolving deltas: 100% (60/60), done.
Checking connectivity... done.
cmake -E make_directory build && cd build && cmake .. -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/home/nathan/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/home/nathan/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$(getconf _NPROCESSORS_ONLN) install

-- The C compiler identification is GNU 4.8.4
-- The CXX compiler identification is GNU 4.8.4
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Found Torch7 in /home/nathan/torch/install
-- Found CUDA: /usr/local/cuda-8.0 (found suitable version "8.0", minimum required is "6.5") 
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Automatic GPU detection failed. Building for common architectures.
-- Autodetected CUDA architecture(s): 3.0;3.5;5.0;5.2;6.0;6.1;6.1+PTX
-- got cuda version 8.0
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -gencode;arch=compute_30,code=sm_30;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_52,code=sm_52;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_61,code=compute_61;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Configuring done
-- Generating done
-- Build files have been written to: /tmp/luarocks_cutorch-scm-1-510/cutorch/build
[  2%] [  2%] [  3%] [  4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCHalf.cu.o
Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCReduceApplyUtils.cu.o
Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCBlas.cu.o
Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCSleep.cu.o
[  5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorage.cu.o
[  6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorageCopy.cu.o
[  8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensor.cu.o
[  9%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorCopy.cu.o
[ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o
[ 11%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath2.cu.o
/usr/local/cuda-8.0/include/thrust/detail/reference.inl(127): error: no default constructor exists for class "thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>"
          detected during:
            instantiation of "void thrust::reference<Element, Pointer, Derived>::assign_from(OtherPointer) [with Element=signed long, Pointer=thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>, Derived=thrust::use_default, OtherPointer=const signed long *]" 
(65): here
            instantiation of "thrust::reference<Element, Pointer, Derived>::derived_type &thrust::reference<Element, Pointer, Derived>::operator=(const thrust::reference<Element, Pointer, Derived>::value_type &) [with Element=signed long, Pointer=thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>, Derived=thrust::use_default]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/reduce_intervals.inl(150): here
            instantiation of "void thrust::system::cuda::detail::commutative_reduce_intervals_closure<InputIterator, OutputIterator, BinaryFunction, Decomposition, Context>::operator()() [with InputIterator=thrust::transform_iterator<thrust::detail::predicate_to_integral<NonZeroOp<unsigned char>, signed long>, thrust::device_ptr<unsigned char>, signed long, thrust::use_default>, OutputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, BinaryFunction=thrust::plus<signed long>, Decomposition=thrust::system::detail::internal::uniform_decomposition<signed long>, Context=thrust::system::cuda::detail::detail::blocked_thread_array]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/detail/launch_closure.inl(52): here
            instantiation of "void thrust::system::cuda::detail::detail::launch_closure_by_value(Closure) [with Closure=thrust::system::cuda::detail::commutative_reduce_intervals_closure<thrust::transform_iterator<thrust::detail::predicate_to_integral<NonZeroOp<unsigned char>, signed long>, thrust::device_ptr<unsigned char>, signed long, thrust::use_default>, thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, thrust::plus<signed long>, thrust::system::detail::internal::uniform_decomposition<signed long>, thrust::system::cuda::detail::detail::blocked_thread_array>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/detail/launch_closure.inl(81): here
            instantiation of "thrust::system::cuda::detail::detail::closure_launcher_base<Closure, launch_by_value>::launch_function_t thrust::system::cuda::detail::detail::closure_launcher_base<Closure, launch_by_value>::get_launch_function() [with Closure=thrust::system::cuda::detail::commutative_reduce_intervals_closure<thrust::transform_iterator<thrust::detail::predicate_to_integral<NonZeroOp<unsigned char>, signed long>, thrust::device_ptr<unsigned char>, signed long, thrust::use_default>, thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, thrust::plus<signed long>, thrust::system::detail::internal::uniform_decomposition<signed long>, thrust::system::cuda::detail::detail::blocked_thread_array>, launch_by_value=true]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/detail/launch_closure.inl(206): here
            [ 2 instantiation contexts not shown ]
            instantiation of "thrust::system::cuda::detail::detail::launch_calculator<Closure>::launch_calculator() [with Closure=thrust::system::cuda::detail::commutative_reduce_intervals_closure<thrust::transform_iterator<thrust::detail::predicate_to_integral<NonZeroOp<unsigned char>, signed long>, thrust::device_ptr<unsigned char>, signed long, thrust::use_default>, thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, thrust::plus<signed long>, thrust::system::detail::internal::uniform_decomposition<signed long>, thrust::system::cuda::detail::detail::blocked_thread_array>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/reduce_intervals.inl(189): here
            instantiation of "void thrust::system::cuda::detail::reduce_intervals(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator, OutputIterator, BinaryFunction, Decomposition) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator=thrust::transform_iterator<thrust::detail::predicate_to_integral<NonZeroOp<unsigned char>, signed long>, thrust::device_ptr<unsigned char>, signed long, thrust::use_default>, OutputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, BinaryFunction=thrust::plus<signed long>, Decomposition=thrust::system::detail::internal::uniform_decomposition<signed long>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/copy_if.inl(201): here
            instantiation of "OutputIterator thrust::system::cuda::detail::copy_if_detail::copy_if(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/copy_if.inl(251): here
            instantiation of "OutputIterator thrust::system::cuda::detail::copy_if(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/usr/local/cuda-8.0/include/thrust/detail/copy_if.inl(60): here
            instantiation of "OutputIterator thrust::copy_if(const thrust::detail::execution_policy_base<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/tmp/luarocks_cutorch-scm-1-510/cutorch/lib/THC/generic/THCTensorMath.cu(313): here

/usr/local/cuda-8.0/include/thrust/detail/reference.inl(98): error: no default constructor exists for class "thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>"
          detected during:
            instantiation of "thrust::reference<Element, Pointer, Derived>::operator thrust::reference<Element, Pointer, Derived>::value_type() const [with Element=signed long, Pointer=thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>, Derived=thrust::use_default]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/bulk/algorithm/scan.hpp(514): here
            instantiation of "RandomAccessIterator2 thrust::system::cuda::detail::bulk_::inclusive_scan(thrust::system::cuda::detail::bulk_::concurrent_group<thrust::system::cuda::detail::bulk_::agent<grainsize>, size> &, RandomAccessIterator1, RandomAccessIterator1, RandomAccessIterator2, BinaryFunction) [with size=512UL, grainsize=3UL, RandomAccessIterator1=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, RandomAccessIterator2=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, BinaryFunction=thrust::plus<signed long>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/scan.inl(55): here
            instantiation of "void thrust::system::cuda::detail::scan_detail::inclusive_scan_n::operator()(ConcurrentGroup &, InputIterator, Size, OutputIterator, BinaryFunction) [with ConcurrentGroup=thrust::system::cuda::detail::bulk_::concurrent_group<thrust::system::cuda::detail::bulk_::agent<3UL>, 512UL>, InputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, Size=signed long, OutputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, BinaryFunction=thrust::plus<signed long>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/bulk/detail/apply_from_tuple.hpp(83): here
            instantiation of "void thrust::system::cuda::detail::bulk_::detail::apply_from_tuple(Function, const thrust::tuple<Arg1, Arg2, Arg3, Arg4, Arg5, thrust::null_type, thrust::null_type, thrust::null_type, thrust::null_type, thrust::null_type> &) [with Function=thrust::system::cuda::detail::scan_detail::inclusive_scan_n, Arg1=thrust::system::cuda::detail::bulk_::concurrent_group<thrust::system::cuda::detail::bulk_::agent<3UL>, 512UL> &, Arg2=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, Arg3=signed long, Arg4=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, Arg5=thrust::plus<signed long>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/bulk/detail/closure.hpp(50): here
            instantiation of "void thrust::system::cuda::detail::bulk_::detail::closure<Function, Tuple>::operator()() [with Function=thrust::system::cuda::detail::scan_detail::inclusive_scan_n, Tuple=thrust::tuple<thrust::system::cuda::detail::bulk_::concurrent_group<thrust::system::cuda::detail::bulk_::agent<3UL>, 512UL> &, thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, signed long, thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, thrust::plus<signed long>, thrust::null_type, thrust::null_type, thrust::null_type, thrust::null_type, thrust::null_type>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/bulk/detail/cuda_task.hpp(58): here
            [ 11 instantiation contexts not shown ]
            instantiation of "OutputIterator thrust::system::cuda::detail::inclusive_scan(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator, InputIterator, OutputIterator, AssociativeOperator) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, OutputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, AssociativeOperator=thrust::plus<signed long>]" 
/usr/local/cuda-8.0/include/thrust/detail/scan.inl(63): here
            instantiation of "OutputIterator thrust::inclusive_scan(const thrust::detail::execution_policy_base<DerivedPolicy> &, InputIterator, InputIterator, OutputIterator, AssociativeOperator) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, OutputIterator=thrust::detail::normal_iterator<thrust::pointer<signed long, thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, thrust::use_default, thrust::use_default>>, AssociativeOperator=thrust::plus<signed long>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/copy_if.inl(204): here
            instantiation of "OutputIterator thrust::system::cuda::detail::copy_if_detail::copy_if(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/usr/local/cuda-8.0/include/thrust/system/cuda/detail/copy_if.inl(251): here
            instantiation of "OutputIterator thrust::system::cuda::detail::copy_if(thrust::system::cuda::detail::execution_policy<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/usr/local/cuda-8.0/include/thrust/detail/copy_if.inl(60): here
            instantiation of "OutputIterator thrust::copy_if(const thrust::detail::execution_policy_base<DerivedPolicy> &, InputIterator1, InputIterator1, InputIterator2, OutputIterator, Predicate) [with DerivedPolicy=thrust::detail::execute_with_allocator<THCThrustAllocator, thrust::system::cuda::detail::execute_on_stream_base>, InputIterator1=thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, InputIterator2=thrust::device_ptr<unsigned char>, OutputIterator=thrust::permutation_iterator<thrust::device_ptr<long>, thrust::transform_iterator<strided_range<thrust::device_ptr<long>>::stride_functor, thrust::counting_iterator<long, thrust::use_default, thrust::use_default, thrust::use_default>, thrust::use_default, thrust::use_default>>, Predicate=NonZeroOp<unsigned char>]" 
/tmp/luarocks_cutorch-scm-1-510/cutorch/lib/THC/generic/THCTensorMath.cu(313): here

2 errors detected in the compilation of "/tmp/tmpxft_00000f92_00000000-17_THCTensorMath.compute_61.cpp1.ii".
CMake Error at THC_generated_THCTensorMath.cu.o.cmake:267 (message):
  Error generating file
  /tmp/luarocks_cutorch-scm-1-510/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/./THC_generated_THCTensorMath.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
make[1]: *** [lib/THC/CMakeFiles/THC.dir/all] Error 2
make: *** [all] Error 2

Error: Build error: Failed building.

I'm running Ubuntu 14, and CUDA 8.0.

lillekemiker commented 7 years ago

I think this is similar to the error I am experiencing, although the error happens while compiling THC_generated_THCTensorSortInt.cu.o instead. I was attempting a fresh install using torch/distro with the install.sh script on Ubuntu 14.04.5 with CUDA 8.0. However, I also tried installing with the current master on both torch7 (3caabb9) and cutorch (16fb59f) to no avail.

This is all the output generated from the install.sh script: http://pastebin.com/zXVrzWkG

akinokaze commented 7 years ago

Experiencing a similar issue as well.

CentOS 7.2, CUDA 8, gcc 4.8

Output of luarocks install cutorch here:

http://pastebin.com/ukmf0LdR

pavanky commented 7 years ago

Possibly related to this issue: https://github.com/torch/torch7/issues/918

@akinokaze @lillekemiker @Nyrt Are you using the beta release of CUDA 8.0 or the final release?

akinokaze commented 7 years ago

Release here.

/usr/local/cuda-8.0/bin/nvcc --version

nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2016 NVIDIA Corporation Built on Wed_May__4_21:01:56_CDT_2016 Cuda compilation tools, release 8.0, V8.0.26

pavanky commented 7 years ago

@akinokaze That's pre-release. Release is v8.0.44

akinokaze commented 7 years ago

Today I learned! I'll try updating and see if that fixes my issue.

akinokaze commented 7 years ago

Looks like it works. Going to 8.0.44 and cutorch installed successfully. Thanks for the help.

Nyrt commented 7 years ago

Looks like I'm also using 8.0.26. Let's see if that works consistently.

Nyrt commented 7 years ago

Yup! That appears to have done it! Thanks @pavanky, I've been banging my head against this for a while.

lillekemiker commented 7 years ago

I can confirm that I am also running CUDA 8.0.26. Thanks for your help @pavanky

pokaxpoka commented 7 years ago

I have a similar problem even though I'm using 8.0.44..

I attach my error message. Do I get some advice ?

out.txt

Thanks.

pavanky commented 7 years ago

The includes seem to be pointing to the older version. Can you delete CUDA (/usr/local/cuda) and reinstall it ?

pokaxpoka commented 7 years ago

Thanks for your adivce ! I delete CUDA (/usr/local/cuda) and reinstall it as follows:

sudo apt-get autoremove cuda sudo dpkg -i cuda-repo-ubuntu1404-8-0-local_8.0.44-1_amd64-deb sudo apt-get update sudo apt-get install cuda

But, I still have the same problem.

Output here: output2.txt


I solve this problem by deleting both (/usr/local/cuda-8.0) and (/usr/local/cuda) and reinstalling cuda !

Thanks @pavanky !

Botong commented 7 years ago

Hi, I am also using cuda 8.0.26 however, I don't have root access to update cuda, do you have a way around this issue? Thanks!

pavanky commented 7 years ago

@Botong you can always download the cuda toolkit zip file and unpack it in your home directory. You don't need root privileges for this.

You'll have to export CUDA_BIN_PATH=/path/to/new/cuda-8.0 before installing the packages for them to pick up the right cuda packages.

pavanky commented 7 years ago

Direct link to the run file: https://developer.nvidia.com/compute/cuda/8.0/prod/local_installers/cuda_8.0.44_linux-run

popperXP commented 7 years ago

very helpful