torch / cutorch

A CUDA backend for Torch7
Other
337 stars 208 forks source link

Fails to build against CUDA 10 #834

Open eigengrau opened 6 years ago

eigengrau commented 6 years ago

When building HEAD against CUDA 10.0.130, the build fails with:

/home/seb/tmp/build/torch7-cutorch-git/src/torch7-cutorch-git/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/run/user/1000/tmpxft_00007438_00000000-4_THCTensorIndex.cpp4.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /home/seb/tmp/build/torch7-cutorch-git/src/torch7-cutorch-git/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o
granzuglia commented 6 years ago

I have exactly the same issue.

nagadomi commented 6 years ago

EDIT: I have merged these changes to my distro repo. https://github.com/nagadomi/distro

Building Torch with CUDA10 has two problems.

  1. cmake/3.6/Modules/FindCUDA.cmake is outdated.
  2. atomicAdd(__half *address, __half val) is defined in /usr/local/cuda/include/cuda_fp16.h (duplicated).

How to build Torch with CUDA10

  1. Install the latest CMake from github repo (the latest FindCUDA.cmake will be installed)
$ sudo apt-get purge cmake
$ git clone https://github.com/Kitware/CMake.git
$ cd CMake
$ ./bootstrap; make; sudo make install
  1. Remove FindCUDA.cmake.
$ cd ~/torch
$ rm -fr cmake/3.6/Modules/FindCUDA*
  1. Apply the following patch to cutorch
diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
index 400875c..ccb7a1c 100644
--- a/lib/THC/THCAtomics.cuh
+++ b/lib/THC/THCAtomics.cuh
@@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
 }

 #ifdef CUDA_HALF_TENSOR
+#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
 static inline  __device__ void atomicAdd(half *address, half val) {
   unsigned int * address_as_ui =
       (unsigned int *) ((char *)address - ((size_t)address & 2));
@@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
    } while (assumed != old);
 }
 #endif
+#endif
$ cd extra/cutorch
$ cat > atomic.patch
<copy and paste the patch>
$ patch -p1 < atomic.patch
  1. Build
    $ ./clean.sh
    $ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
    $ ./install.sh

I was successfully build Torch with CUDA10 with the above steps.

WhiskyChoy commented 6 years ago

@nagadomi Yeah, it really works! Thanks a lot. I learnt elsewhere that a newer version of cmake is needed so I installed one. However I didn't remove the FindCUDA.cmake the torch package contained. Now it works well, after I finished the steps you recommended.

Archit01 commented 6 years ago

Thanks

HZRun commented 6 years ago

@nagadomi It works following your steps. Thank you

PhilipDro commented 6 years ago

@nagadomi Thanks a lot that helped with my issue. It seems though, that removing FindCUDA will cause problems when trying to install cuDDN.

nagadomi commented 6 years ago

@PhilipDro The master branch of cuDNN.torch does not support cuDNN v7. Installing from R7 branch probably works fine.

$ git clone https://github.com/soumith/cudnn.torch.git -b R7
$ cd cudnn.torch
$ luarocks make cudnn-scm-1.rockspec
PhilipDro commented 6 years ago

@nagadomi Thank you for your response. Your suggested workaround does not work unfortunately. I get following error:

`CMake Error at /home/philip/torch/install/share/cmake/torch/FindCUDA.cmake:617 (message): Specify CUDA_TOOLKIT_ROOT_DIR Call Stack (most recent call first): CMakeLists.txt:14 (FIND_PACKAGE)

-- Configuring incomplete, errors occurred! See also "/home/philip/cudnn.torch/build/CMakeFiles/CMakeOutput.log".

Error: Build error: Failed building. `

Anupama94 commented 6 years ago

Try changing the gcc version to gcc-6. Worked for me!

Cdfghglz commented 6 years ago

I also could not make it work until I did not kick out these lines:

diff --git a/install.sh b/install.sh
index ad65434..65a0163 100755
--- a/install.sh
+++ b/install.sh
@@ -99,9 +99,6 @@ fi
 if [ -x "$path_to_nvcc" ] || [ -x "$path_to_nvidiasmi" ]
 then
     echo "Found CUDA on your machine. Installing CMake 3.6 modules to get up-to-date FindCUDA"
-    cd ${THIS_DIR}/cmake/3.6 && \
-(cmake -E make_directory build && cd build && cmake .. -DCMAKE_INSTALL_PREFIX="${PREFIX}" \
-        && make install) && echo "FindCuda bits of CMake 3.6 installed" || exit 1
 fi

After this it just used the newest cmake (3.13 for me) and did not complain any more.

JohnHammell commented 5 years ago

Thank you @nagadomi ! Your instructions above were very helpful in getting Torch7 installed and running with CUDA 10 & cuDNN 7.3 on Ubuntu 18.04 LTS

rahilwazir commented 5 years ago

@nagadomi I'm trying to install it via luarocks install cutorch but your workaround didn't work for me. Please help me with this

...
...
[ 16%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o
[ 17%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o
/tmp/luarocks_cutorch-scm-1-6307/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/jet/tmp/tmpxft_000050de_00000000-12_THCTensorIndex.compute_70.cpp1.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /tmp/luarocks_cutorch-scm-1-9488/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:161: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....

Using the newer CMake with the applied patch

cmake --version
cmake version 3.13.20181111-g206ce3

CMake suite maintained and supported by Kitware (kitware.com/cmake).
kyang-06 commented 5 years ago

@nagadomi I'm trying to install it via luarocks install cutorch but your workaround didn't work for me. Please help me with this

...
...
[ 16%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o
[ 17%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o
/tmp/luarocks_cutorch-scm-1-6307/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/jet/tmp/tmpxft_000050de_00000000-12_THCTensorIndex.compute_70.cpp1.ii".
CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message):
  Error generating file
  /tmp/luarocks_cutorch-scm-1-9488/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:161: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed
make[2]: *** [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....

Using the newer CMake with the applied patch

cmake --version
cmake version 3.13.20181111-g206ce3

CMake suite maintained and supported by Kitware (kitware.com/cmake).

have the same issue with you

nagadomi commented 5 years ago

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec
kyang-06 commented 5 years ago

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec

Great! It works. Thank you so much

doneforaiur commented 5 years ago

I'm trying to install it via luarocks install cutorch

luarocks install cutorch command downloads cutorch from git repo and installs it, so the patch will not be applied. To install modified cutorch from local disk, use the following command in cutorch dir.

$ luarocks make rocks/cutorch-scm-1.rockspec

/home/lelu/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(393): error: more than one operator "==" matches these operands:
function "operator==(const __half &, const __half &)"
function "operator==(half, half)"
operand types are: half == half

/home/lelu/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(414): error: more than one operator "==" matches these operands: function "operator==(const half &, const half &)" function "operator==(half, half)" operand types are: half == half

2 errors detected in the compilation of "/tmp/tmpxft_000021ca_00000000-4_THCTensorMath.cpp4.ii". CMake Error at THC_generated_THCTensorMath.cu.o.Release.cmake:279 (message): Error generating file /home/lelu/torch/extra/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:3113: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o' failed make[2]: [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o] Error 1 make[2]: Bitmemiş işler için bekliyor....

lib/THC/CMakeFiles/THC.dir/build.make:4792: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o' failed make[2]: [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o] Error 1 CMakeFiles/Makefile2:177: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed make[1]: [lib/THC/CMakeFiles/THC.dir/all] Error 2 Makefile:129: recipe for target 'all' failed make: *** [all] Error 2

Error: Build error: Failed building.



@nagadomi I'm still having this issue. ;-;
nagadomi commented 5 years ago

@doneforaiur Try the following command before luarocks make..

$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
lh14576 commented 5 years ago

@nagadomi Thank you! This just saved my day! Was just about to return my RTX2070

fiandro commented 5 years ago

Thanks to @nagadomi for the help, I could successfully build luatorch against cuda 10 and cudnn7 on Ubuntu 16.04, RTX2080. I canonly add that it is not necessary to unistall the system-wide cmake, it is sufficient to build it from the sources on some user directory such as /home/user/myrepos/and then do export PATH="/home/user/myrepos/CMake/bin:${PATH}" before building luatorch.

nagadomi commented 5 years ago

@fiandro I merged changes for CUDA10 into a personal distro repo yesterday. It was not really necessary to install newest CMake, but a few lines of patches for cmake/3.6/Modules/FindCUDA.cmake. diff: https://github.com/nagadomi/distro/commit/946affd92c5818e880e414c915ce23e003b9aa9d#diff-9ea7fc52fa0c8de646449b3b8cda3ac9

Related to https://github.com/nagadomi/waifu2x/issues/253#issuecomment-445448928

AlekzNet commented 5 years ago

Does it work with gcc-8?

AlekzNet commented 5 years ago

Compilation fails with the following errors:

luarocks make rocks/cutorch-scm-1.rockspec                                                                                                     Warning: unmatched variable LUALIB  

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores"
cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/home/alekz/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/home/alekz/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/home/alekz/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 8 cores
-- The C compiler identification is GNU 8.2.0
-- The CXX compiler identification is GNU 8.2.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found Torch7 in /home/alekz/torch/install
-- Looking for pthread.h
-- Looking for pthread.h - found
-- Looking for pthread_kill
-- Looking for pthread_kill - not found
-- Looking for pthread_create in pthreads
-- Looking for pthread_create in pthreads - not found
-- Looking for pthread_create in pthread
-- Looking for pthread_create in pthread - found
-- Found Threads: TRUE  
-- Found CUDA: /usr/local/cuda (found suitable version "10.0", minimum required is "6.5") 
-- Removing -DNDEBUG from compile flags
-- TH_LIBRARIES: TH
-- MAGMA not found. Compiling without MAGMA support
-- Autodetected CUDA architecture(s):  5.2
-- got cuda version 10.0
-- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor
-- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_52,code=sm_52;-DCUDA_HAS_FP16=1
-- THC_SO_VERSION: 0
-- Performing Test HAS_LUAL_SETFUNCS
-- Performing Test HAS_LUAL_SETFUNCS - Failed
-- Configuring done
-- Generating done
-- Build files have been written to: /var/tmp/cutorch/build
[  2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o
[  2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o
[  3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o
[  4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o
[  5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o
[  6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o
[  8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o
[  8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

6 errors detected in the compilation of "/tmp/tmpxft_00006cb1_00000000-6_THCSleep.cpp1.ii".
6 errors detected in the compilation of "/tmp/tmpxft_00006cb9_00000000-6_THCBlas.cpp1.ii".
CMake Error at THC_generated_THCSleep.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCSleep.cu.o

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:79: lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o] Error 1
make[2]: *** Waiting for unfinished jobs....
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

CMake Error at THC_generated_THCBlas.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCBlas.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:72: lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o] Error 1
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

6 errors detected in the compilation of "/tmp/tmpxft_00006cca_00000000-6_THCReduceApplyUtils.cpp1.ii".
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

CMake Error at THC_generated_THCReduceApplyUtils.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCReduceApplyUtils.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:65: lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006cd7_00000000-6_THCTensor.cpp1.ii".
CMake Error at THC_generated_THCTensor.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensor.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:100: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006ce2_00000000-6_THCStorageCopy.cpp1.ii".
CMake Error at THC_generated_THCStorageCopy.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorageCopy.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:93: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o] Error 1
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

6 errors detected in the compilation of "/tmp/tmpxft_00006ce1_00000000-6_THCTensorCopy.cpp1.ii".
/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: type name is not allowed

/usr/include/c++/8/type_traits(1049): error: identifier "__is_assignable" is undefined

CMake Error at THC_generated_THCTensorCopy.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorCopy.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:107: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o] Error 1
/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

/usr/include/c++/8/utility(307): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: pack expansion does not make use of any argument packs

/usr/include/c++/8/utility(329): error: expected a ">"
          detected during instantiation of type "std::make_integer_sequence" 
(340): here

6 errors detected in the compilation of "/tmp/tmpxft_00006d00_00000000-6_THCHalf.cpp1.ii".
CMake Error at THC_generated_THCHalf.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCHalf.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:562: lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o] Error 1
6 errors detected in the compilation of "/tmp/tmpxft_00006d08_00000000-6_THCStorage.cpp1.ii".
CMake Error at THC_generated_THCStorage.cu.o.Release.cmake:279 (message):
  Error generating file
  /var/tmp/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCStorage.cu.o

make[2]: *** [lib/THC/CMakeFiles/THC.dir/build.make:86: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o] Error 1
make[1]: *** [CMakeFiles/Makefile2:177: lib/THC/CMakeFiles/THC.dir/all] Error 2
make: *** [Makefile:130: all] Error 2

Error: Build error: Failed building.

Any ideas?

nagadomi commented 5 years ago

@AlekzNet gcc versions later than 7 are not supported on CUDA10. ax3l/CUDA_Compilers.md

AlekzNet commented 5 years ago

Mmm... I'm getting the same error with gcc-7.

nagadomi commented 5 years ago

Perhaps you need to delete CMakeCache.txt.

$ rm build/CMakeCache.txt                                      
$ CC=gcc-7 CXX=g++-7 luarocks make rocks/cutorch-scm-1.rockspec
AlekzNet commented 5 years ago

Yes! This is it! I thought I cleaned everything, but obviously not. Thanks much @nagadomi !

fiandro commented 5 years ago

Thanks a lot to @nagadomi for the updated repo: Since I had to write a complete howto to installing Torch on Ubuntu 16.04 on RTX equipped machines, I tought to share it with you:

**** Installing (Lua)Torch on Ubuntu 16.04.5 LTS (Updated as 27/12/2018)

This is a tutorial to installing Torch on Ubuntu 16.04 for use with the new NVIDIA RTX20x0 GPUs (older GPUs such as GTX10x0 shall be supported as well). This tutorial addresses the problem of Torch being in mainteinance mode, officially not being able to run on RTX GPUs due to a complex chain of dependencies (look at this thread for more information https://github.com/torch/cutorch/issues/834). This tutorial will guide you from installing the NVIDIA kernel drivers to getting a working Torch version on RTX NVIDIA GPUs. Notice that GCC v 5 or 6 is assumed to be the default compiler; if not, use update-alternatives to set the correct GCC/G++ pair version (see https://askubuntu.com/questions/26498/how-to-choose-the-default-gcc-and-g-version).

leoniescape commented 5 years ago

Try changing the gcc version to gcc-6. Worked for me!

It also works for me when I downgraded to gcc-6.5. Previously I was uisng gcc7 , I tried luarocks make cudnn-scm-1.rockspec, but cudnn can't be found when running torch. Now it works when I use gcc6.

xiaoshuliu commented 5 years ago

@nagadomi I successfully installed everything, but it cannot pass all the tests in test.sh.

nagadomi commented 5 years ago

@xiaoshuliu PReLU's test error is a known issue. I just ignore it because I do not use PReLU. I think it's just an issue with the test suite. https://github.com/torch/torch7/issues/1104 https://github.com/torch/cunn/issues/474 https://github.com/torch/distro/issues/225

xiaoshuliu commented 5 years ago

@xiaoshuliu PReLU's test error is a known issue. I just ignore it because I do not use PReLU. I think it's just an issue with the test suite. torch/torch7#1104 torch/cunn#474 torch/distro#225

@nagadomi Thank you! I have three ERRORs in PReLu (forward, back) and multinomial_alias, and a FAIL in VolumetricFullConvolution_pair_test. Do you know which package "VolumetricFullConvolution_pair_test" belongs to, and does it matter much?

multinomial_alias Function call failed /home/xiaoshul/torch/install/share/lua/5.1/torch/Tensor.lua:462: bad argument #1 to 'set' (expecting number or Tensor or Storage) stack traceback: [C]: in function 'set' /home/xiaoshul/torch/install/share/lua/5.1/torch/Tensor.lua:462: in function 'view' /home/xiaoshul/torch/install/share/lua/5.1/cutorch/test.lua:3264: in function 'v' /home/xiaoshul/torch/install/share/lua/5.1/cutorch/test.lua:4916: in function </home/xiaoshul/torch/install/share/lua/5.1/cutorch/test.lua:4914> [C]: in function 'xpcall' /home/xiaoshul/torch/install/share/lua/5.1/torch/Tester.lua:477: in function '_pcall' /home/xiaoshul/torch/install/share/lua/5.1/torch/Tester.lua:436: in function '_run' /home/xiaoshul/torch/install/share/lua/5.1/torch/Tester.lua:355: in function 'run' /home/xiaoshul/torch/install/share/lua/5.1/cutorch/test.lua:4934: in function 'test' [string "cutorch.test()"]:1: in main chunk [C]: in function 'pcall' ...shul/torch/install/lib/luarocks/rocks/trepl/scm-1/bin/th:136: in main chunk [C]: at 0x00405e90

VolumetricFullConvolution_pair_test error on state (backward) with torch.CudaHalfTensor LT failed: 0.1875 >= 0.11 /home/xiaoshul/torch/install/share/lua/5.1/cunn/test.lua:5686: in function 'v' /home/xiaoshul/torch/install/share/lua/5.1/cunn/test.lua:6670: in function </home/xiaoshul/torch/install/share/lua/5.1/cunn/test.lua:6668>

Follow up: Looks like it is located in cunn. For cuda 10, shall I install cunn using "luarocks install cunn"? Or use some other version instead?

QROST commented 5 years ago

I encountered an error The imported target “Qt5::Gui” references the file “/usr/lib/x86_64-linux-gnu/libEGL.so” but this file does not exist. while building and installing the new CMake.

I found the solution in the link below. https://askubuntu.com/questions/616065/the-imported-target-qt5gui-references-the-file-usr-lib-x86-64-linux-gnu-li

NiklasVJM commented 5 years ago

I've been trying to install "luarocks install cutorch", but it keeps failing at the same spot:

/home/testbench/Downloads/cutorch-master/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_0000737e_00000000-4_THCTensorScatterGather.cpp4.ii". CMake Error at THC_generated_THCTensorScatterGather.cu.o.Release.cmake:279 (message): Error generating file /home/testbench/Downloads/cutorch-master/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorScatterGather.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:8288: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorScatterGather.cu.o' failed make[2]: [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorScatterGather.cu.o] Error 1 make[2]: Waiting for unfinished jobs.... /home/testbench/Downloads/cutorch-master/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_00007386_00000000-4_THCTensorIndex.cpp4.ii". CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:279 (message): Error generating file /home/testbench/Downloads/cutorch-master/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:6883: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o' failed make[2]: [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1 CMakeFiles/Makefile2:168: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed make[1]: [lib/THC/CMakeFiles/THC.dir/all] Error 2 Makefile:129: recipe for target 'all' failed make: *** [all] Error 2

Error: Build error: Failed building.

I have successfully installed just about everything else, including CUDA 10.0, newer Cmake, "luarocks install cunn" -worked just fine, and I tried Nagadomi's tip:

$ rm build/CMakeCache.txt
$ CC=gcc-7 CXX=g++-7 luarocks make rocks/cutorch-scm-1.rockspec

no difference... Is there any other tips or tricks that you could suggest?

generated commented 5 years ago

sorry for this question, patch -p1 < atomic.patch outputs:

can't find file to patch at input line 5

any help appreciated

nagadomi commented 5 years ago

@generated The error is probably due to the wrong working directory (should run on ~/torch/extra/cutorch). I recommend that you simply install torch from my repo.

# rm -r ~/torch # delete old installation
git clone https://github.com/nagadomi/distro.git ~/torch --recursive
cd ~/torch
./install-deps
./install.sh
HaoDing123 commented 5 years ago

Hello,everyone! when I install cutorch,I meet some error. I have installed torch7+cuda10.0+cudnn7.4.1 in ubuntu16.04 with RTX2080( Driver Version: 410.78 ) I tried the command : dh@dh-MS-7B98:~/torch/extra/cutorch$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__" dh@dh-MS-7B98:~/torch/extra/cutorch$ luarocks make rocks/cutorch-scm-1.rockspecWarning: unmatched variable LUALIB

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores" cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/home/dh/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/home/dh/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/home/dh/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 12 cores -- Found Torch7 in /home/dh/torch/install -- Removing -DNDEBUG from compile flags -- TH_LIBRARIES: TH -- MAGMA not found. Compiling without MAGMA support -- Autodetected CUDA architecture(s): 7.5 -- got cuda version 10.0 -- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor -- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_75,code=sm_75;-DCUDA_HAS_FP16=1 -- THC_SO_VERSION: 0 -- Configuring done -- Generating done -- Build files have been written to: /home/dh/torch/extra/cutorch/build [ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o [ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o [ 3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o [ 4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o [ 5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o [ 6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o [ 7%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath2.cu.o [ 8%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathBlas.cu.o [ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathMagma.cu.o [ 11%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathPairwise.cu.o [ 12%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathReduce.cu.o [ 13%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathScan.cu.o CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THStorage.h CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THStorageCopy.h CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THTensor.h ...... ...... [ 18%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorTopK.cu.o CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THStorage.h CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THStorageCopy.h CMake Warning at /usr/share/cmake-3.5/Modules/FindCUDA/make2cmake.cmake:65 (message): Removing non-existent dependency file: generic/THTensor.h ...... ...... ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLi2EEv10TensorInfoIT0_T1_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLi1EEv10TensorInfoIT0_T1_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply1I19TensorAddConstantOpIhEhjLin2EEv10TensorInfoIT0_T1_ES4T is out of range. .minnctapersm will be ignored CMakeFiles/Makefile2:172: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed make[1]: [lib/THC/CMakeFiles/THC.dir/all] Error 2 Makefile:127: recipe for target 'all' failed make: [all] Error 2

Error: Build error: Failed building.

Can kindly people help me?

zhengqibin commented 5 years ago

sorry for this question, patch -p1 < atomic.patch outputs:

can't find file to patch at input line 5

any help appreciated first you should change the directory by: cd extra/cutorch

rehanpunjwani commented 4 years ago

I have done all the steps as above but when I rull install.sh I get the following error:

make[2]: No rule to make target 'exe/luajit-rocks/luajit-2.1/src/lib_ffi.c', needed by 'exe/luajit-rocks/luajit-2.1/vmdef.lua'. Stop. make[1]: [CMakeFiles/Makefile2:212: exe/luajit-rocks/luajit-2.1/CMakeFiles/luajit-static.dir/all] Error 2 make: *** [Makefile:130: all] Error 2 ./install.sh: line 107: /home/pi/torch/install/bin/luarocks: No such file or directory Installing common Lua packages ./install.sh: line 111: /home/pi/torch/install/bin/luarocks: No such file or directory Please help!!

Yijunmaverick commented 4 years ago

Many thanks with the guidance and I made it work with cuda 10. I have a question about how to install new packages (e.g., matio). It looks luarocks install XXX does not work, as you mentioned on installing cutorch.

nagadomi commented 4 years ago

@Yijunmaverick works for me.

% sudo apt-get install libmatio-dev
% luarocks install matio

test

% git clone https://github.com/soumith/matio-ffi.torch.git
% cd matio-ffi.torch/test 
% th test.lua 
(1,.,.) = 
   1   7  13  19
   3   9  15  21
   5  11  17  23

(2,.,.) = 
   2   8  14  20
   4  10  16  22
   6  12  18  24
[torch.DoubleTensor of size 2x3x4]

OK  
OK  
OK  
Could not find variable with name: b in file: test.mat  
OK  
OK  
OK  
OK  
OK  
OK  
Saving is OK!

I've heard that GFW blocks URLs about luarocks. If git clone can run normally, you can install it from local file.

% git clone https://github.com/soumith/matio-ffi.torch.git
% cd matio-ffi.torch
% luarocks make matio-scm-1.rockspec

rockspec index: https://github.com/torch/rocks

With libmatio4 env(Ubuntu 19.10), Reading string variable from mat file seems to fail (numeric variable is OK).

labbihi commented 4 years ago

I have this error after running : CC=gcc-7 CXX=g++-7 luarocks make rocks/cunn-scm-1.rockspec

[ 26%] Building NVCC (Device) object lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTableBag.cu.o /tmp/cunn/lib/THCUNN/LookupTable.cu(44): error: identifier "FULL_MASK" is undefined

/tmp/cunn/lib/THCUNN/LookupTable.cu(62): error: identifier "FULL_MASK" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: incomplete type is not allowed

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: identifier "LookupTable_accGradParameters" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(5): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(37): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: declaration is incompatible with function template "void cunn_LookupTable_accGradParametersKernelByFeature(long , Dtype , Dtype *, Dtype, ptrdiff_t, long, int)" /tmp/cunn/lib/THCUNN/LookupTable.cu(66): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(47): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: declaration is incompatible with "void THCudaHalfTensor_free(THCState , THCudaHalfTensor )" /root/torch/install/include/THC/generic/THCTensor.h(118): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: identifier "state" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: declaration is incompatible with "void __THCudaCheck(cudaError_t, const char *, int)" /root/torch/install/include/THC/THCGeneral.h(200): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(50): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(51): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: identifier "resize" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: declaration is incompatible with "void THLongStorage_free(THLongStorage *)" /root/torch/install/include/TH/generic/THStorage.h(67): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: identifier "inputSize" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(61): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: namespace "thrust::cuda" has no actual member "par"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: identifier "thrustAlloc" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(86): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(87): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "data" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "origIndices" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(93): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(127): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(128): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(131): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(132): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(142): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: variable "THCudaHalfTensor_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(146): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(175): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "idx" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: identifier "numel" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(177): error: no instance of overloaded function "thrust::unique" matches the argument list argument types are: (thrust::device_ptr<>, )

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(178): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(180): error: identifier "normType" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(183): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(37): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(38): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: variable "cunn_LookupTable_accGradParametersKernelByFeature" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(40): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(47): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: declaration is incompatible with "void THCudaTensor_free(THCState , THCudaTensor )" /root/torch/install/include/THC/generic/THCTensor.h(118): here

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(48): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(49): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(50): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(51): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(54): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: variable "THCIndexTensor_" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(55): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(56): error: variable "THLongStorage_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(61): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: namespace "thrust::cuda" has no actual member "par"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(83): error: expected a ";"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(86): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(87): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: variable "origIndices_data" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(90): error: expression preceding parentheses of apparent call must have (pointer-to-) function type

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(91): error: variable "count_data" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(93): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(127): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(128): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(131): error: variable "block" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(132): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(142): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: variable "THCudaTensor_free" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(144): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: this declaration has no storage class or type specifier

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: variable "__THCudaCheck" has already been defined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(145): error: expected a ")"

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(146): error: expected a declaration

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(175): warning: parsing restarts here after previous syntax error

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: identifier "THCIndex_t" is undefined

/tmp/cunn/lib/THCUNN/generic/LookupTable.cu(176): error: variable "idx_ptr" has already been defined

Error limit reached. 100 errors detected in the compilation of "/tmp/tmpxft_0000590c_00000000-13_LookupTable.compute_75.cpp1.ii". Compilation terminated. CMake Error at THCUNN_generated_LookupTable.cu.o.Release.cmake:280 (message): Error generating file /tmp/cunn/build/lib/THCUNN/CMakeFiles/THCUNN.dir//./THCUNN_generated_LookupTable.cu.o

lib/THCUNN/CMakeFiles/THCUNN.dir/build.make:8563: recipe for target 'lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTable.cu.o' failed make[2]: [lib/THCUNN/CMakeFiles/THCUNN.dir/THCUNN_generated_LookupTable.cu.o] Error 1 make[2]: Attente des tâches non terminées.... /tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddmLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddmLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIdEdddjLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddmLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddmLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLi1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIdEddjLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffmLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffmLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorIfEfffjLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES2_IT1_S4_ES2_IT2_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffmLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffmLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLi1ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELin1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELi2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELi1EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorIfEffjLin2ELin2EEv10TensorInfoIT0_T2_ES2_IT1_S4_ES4T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_mLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_mLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin1ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLi1ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi1ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi1ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi1ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELi1ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin2ELin1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin2ELi2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin2ELi1EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply3I34logSigmoid_updateGradInput_functorI6halfES1_S1_S1_jLin2ELin2ELin2EEv10TensorInfoIT0_T3_ES3_IT1_S5_ES3_IT2_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_mLin1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_mLin2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin1ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin1ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin1ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi2ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi2ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi2ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi1ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi1ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi1ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLi1ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin2ELin1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin2ELi2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin2ELi1EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored ptxas warning : Value of threads per SM for entry _Z21kernelPointwiseApply2I31logSigmoid_updateOutput_functorI6halfES1_S1_jLin2ELin2EEv10TensorInfoIT0_T2_ES3_IT1_S5_ES5T is out of range. .minnctapersm will be ignored /tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

/tmp/cunn/lib/THCUNN/LookupTableBag.cu(18): warning: variable "MODE_SUM" was declared but never referenced

CMakeFiles/Makefile2:131: recipe for target 'lib/THCUNN/CMakeFiles/THCUNN.dir/all' failed make[1]: [lib/THCUNN/CMakeFiles/THCUNN.dir/all] Error 2 Makefile:149: recipe for target 'all' failed make: [all] Error 2

Error: Build error: Failed building.

airdine commented 4 years ago

same issue with RTX 2080 ti, using docker nvidia/cuda:10.0-cudnn7-devel-ubuntu16.04 cunn won't install with same output @labbihi

si someone have an idea?

mikeytag commented 4 years ago

Ok, I was having issues installing luarocks install cutorch and kept doing the stuff that @nagadomi recommends at the top of this thread.

However, I kept corrupting ~/torch/extras/cutorch because I kept applying the atomic.patch described, but I was already cloning torch from https://github.com/nagadomi/torch and his repo already has the patches!

If you're going to use the @nagadomi repo, here's how I got everything working:

cd ~
git clone https://github.com/nagadomi/distro.git ~/torch --recursive
cd ~/torch
./install-deps
./clean.sh
./update.sh
~/torch/install/bin/torch-activate
luarocks install nn
luarocks install torch
cd extras/cutorch
luarocks make rocks/cutorch-scm-1.rockspec
cd ~/torch/extra/cunn
luarocks make rocks/cunn-scm-1.rockspec
cd ~/torch/extra/cudnn
luarocks make cudnn-scm-1.rockspec

NOTE: I'm on Ubuntu 18.04 with CUDA 10.2

THANK YOU SO MUCH @nagadomi !

mherkazandjian commented 3 years ago

in case someone is interested, i put everything in a container

docker run --runtime=nvidia mherkazandjian/char-rnn:latest th train.lua -data_dir data/tinyshakespeare -rnn_size 512 -num_layers 2 -dropout 0.5

the dockerfile is here:

 https://github.com/mherkazandjian/docker-torch-rnn/blob/mher/char-rnn/CUDA/10.0-ubuntu18.04/Dockerfile

that includes a fix based on the dockerfile of https://github.com/crisbal/docker-torch-rnn/tree/master/CUDA/10.0-ubuntu18.04 with some mods to include char-rnn in the container

en1gma713 commented 3 years ago

I keep getting CMake Error: The following variables are used in this project, but they are set to NOTFOUND. Please set them or make sure they are set and tested correctly in the CMake files: CUDA_cublas_LIBRARY (ADVANCED) linked by target "THC" in directory /home/aiden/distro/extra/cutorch/lib/THC How do I fix this?

TingtingLiao commented 3 years ago

I keep getting CMake Error: The following variables are used in this project, but they are set to NOTFOUND. Please set them or make sure they are set and tested correctly in the CMake files: CUDA_cublas_LIBRARY (ADVANCED) linked by target "THC" in directory /home/aiden/distro/extra/cutorch/lib/THC How do I fix this?

Have you solved it? I have the same problem.

TatianaZobnina commented 3 years ago

on google colab after:

!export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__" !/content/torch/install/bin/luarocks make /content/torch/extra/cutorch/rocks/cutorch-scm-1.rockspec

got this error:

Warning: unmatched variable LUALIB

jopts=$(getconf _NPROCESSORS_CONF)

echo "Building on $jopts cores" cmake -E make_directory build && cd build && cmake .. -DLUALIB= -DLUA_INCDIR=/content/torch/install/include -DCMAKE_CXX_FLAGS=${CMAKE_CXX_FLAGS} -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH="/content/torch/install/bin/.." -DCMAKE_INSTALL_PREFIX="/content/torch/install/lib/luarocks/rocks/cutorch/scm-1" && make -j$jopts install

Building on 2 cores -- Found Torch7 in /content/torch/install -- Removing -DNDEBUG from compile flags -- TH_LIBRARIES: TH -- MAGMA not found. Compiling without MAGMA support -- Automatic GPU detection failed. Building for common architectures. -- Autodetected CUDA architecture(s): 3.0;3.5;5.0;5.2;6.0;6.1;7.0;7.0+PTX;7.5;7.5+PTX -- got cuda version 10.0 -- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor -- CUDA_NVCC_FLAGS: -gencode;arch=compute_30,code=sm_30;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_52,code=sm_52;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_70,code=compute_70;-gencode;arch=compute_75,code=compute_75;-DCUDA_HAS_FP16=1 -- THC_SO_VERSION: 0 -- Configuring done -- Generating done -- Build files have been written to: /content/torch/extra/cutorch/build [ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o [ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMathBlas.cu.o /content/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(393): error: more than one operator "==" matches these operands: function "operator==(const half &, const half &)" function "operator==(half, half)" operand types are: half == half

/content/torch/extra/cutorch/lib/THC/generic/THCTensorMath.cu(414): error: more than one operator "==" matches these operands: function "operator==(const half &, const half &)" function "operator==(half, half)" operand types are: half == half

2 errors detected in the compilation of "/tmp/tmpxft_0001830f_00000000-13_THCTensorMath.compute_75.cpp1.ii". CMake Error at THC_generated_THCTensorMath.cu.o.cmake:267 (message): Error generating file /content/torch/extra/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorMath.cu.o

lib/THC/CMakeFiles/THC.dir/build.make:3113: recipe for target 'lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o' failed make[2]: [lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorMath.cu.o] Error 1 make[2]: Waiting for unfinished jobs.... CMakeFiles/Makefile2:177: recipe for target 'lib/THC/CMakeFiles/THC.dir/all' failed make[1]: [lib/THC/CMakeFiles/THC.dir/all] Error 2 Makefile:129: recipe for target 'all' failed make: [all] Error 2

Error: Build error: Failed building.

all other command works perfectly, except cutorch

lochotan4794 commented 3 years ago

@nagadomi Please help me. I follow your instruction but still get the error. -- Removing -DNDEBUG from compile flags -- TH_LIBRARIES: TH -- MAGMA not found. Compiling without MAGMA support -- Automatic GPU detection failed. Building for common architectures. -- Autodetected CUDA architecture(s): 3.5;5.0;5.3;6.0;6.1;7.0;7.5;7.5+PTX -- got cuda version 10.1 -- Found CUDA with FP16 support, compiling with torch.CudaHalfTensor -- CUDA_NVCC_FLAGS: -D__CUDA_NO_HALF_OPERATORS__;-gencode;arch=compute_35,code=sm_35;-gencode;arch=compute_50,code=sm_50;-gencode;arch=compute_53,code=sm_53;-gencode;arch=compute_60,code=sm_60;-gencode;arch=compute_61,code=sm_61;-gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_75,code=sm_75;-gencode;arch=compute_75,code=compute_75;-DCUDA_HAS_FP16=1 -- THC_SO_VERSION: 0 -- Performing Test HAS_LUAL_SETFUNCS -- Performing Test HAS_LUAL_SETFUNCS - Failed -- Configuring done -- Generating done -- Build files have been written to: /tmp/luarocks_cutorch-scm-1-7746/cutorch/build [ 1%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCBlas.cu.o [ 2%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o [ 3%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/generated/THC_generated_THCTensorSortShort.cu.o [ 4%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSleep.cu.o [ 5%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCSortUtils.cu.o [ 6%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCReduceApplyUtils.cu.o [ 7%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o [ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensor.cu.o [ 10%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorConv.cu.o [ 11%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorageCopy.cu.o [ 12%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o [ 13%] Building NVCC (Device) object lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o /tmp/luarocks_cutorch-scm-1-7746/cutorch/lib/THC/THCAtomics.cuh(97): error: cannot overload functions distinguished by return type alone

1 error detected in the compilation of "/tmp/tmpxft_00014089_00000000-12_THCTensorIndex.compute_75.cpp1.ii". CMake Error at THC_generated_THCTensorIndex.cu.o.Release.cmake:280 (message): Error generating file /tmp/luarocks_cutorch-scm-1-7746/cutorch/build/lib/THC/CMakeFiles/THC.dir//./THC_generated_THCTensorIndex.cu.o

make[2]: [lib/THC/CMakeFiles/THC.dir/build.make:175: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorIndex.cu.o] Error 1 make[2]: Waiting for unfinished jobs.... ^Cmake[2]: [lib/THC/CMakeFiles/THC.dir/build.make:182: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorConv.cu.o] Interrupt make[2]: [lib/THC/CMakeFiles/THC.dir/build.make:119: lib/THC/CMakeFiles/THC.dir/THC_generated_THCTensorCopy.cu.o] Interrupt make[2]: [lib/THC/CMakeFiles/THC.dir/build.make:574: lib/THC/CMakeFiles/THC.dir/THC_generated_THCHalf.cu.o] Interrupt make[2]: [lib/THC/CMakeFiles/THC.dir/build.make:98: lib/THC/CMakeFiles/THC.dir/THC_generated_THCStorage.cu.o] Interrupt make[2]: [lib/THC/CMakeFiles/THC.dir/build.make:322: lib/THC/CMakeFiles/THC.dir/generated/THC_generated_THCTensorSortShort.cu.o] Interrupt make[1]: [CMakeFiles/Makefile2:167: lib/THC/CMakeFiles/THC.dir/all] Interrupt make: *** [Makefile:136: all] Interrupt

Error: Build error: Failed building.

qitop commented 2 years ago

EDIT: I have merged these changes to my distro repo. https://github.com/nagadomi/distro

Building Torch with CUDA10 has two problems.

  1. cmake/3.6/Modules/FindCUDA.cmake is outdated.
  2. atomicAdd(__half *address, __half val) is defined in /usr/local/cuda/include/cuda_fp16.h (duplicated).

How to build Torch with CUDA10

  1. Install the latest CMake from github repo (the latest FindCUDA.cmake will be installed)
$ sudo apt-get purge cmake
$ git clone https://github.com/Kitware/CMake.git
$ cd CMake
$ ./bootstrap; make; sudo make install
  1. Remove FindCUDA.cmake.
$ cd ~/torch
$ rm -fr cmake/3.6/Modules/FindCUDA*
  1. Apply the following patch to cutorch
diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
index 400875c..ccb7a1c 100644
--- a/lib/THC/THCAtomics.cuh
+++ b/lib/THC/THCAtomics.cuh
@@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
 }

 #ifdef CUDA_HALF_TENSOR
+#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
 static inline  __device__ void atomicAdd(half *address, half val) {
   unsigned int * address_as_ui =
       (unsigned int *) ((char *)address - ((size_t)address & 2));
@@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
    } while (assumed != old);
 }
 #endif
+#endif
$ cd extra/cutorch
$ cat > atomic.patch
<copy and paste the patch>
$ patch -p1 < atomic.patch
  1. Build
$ ./clean.sh
$ export TORCH_NVCC_FLAGS="-D__CUDA_NO_HALF_OPERATORS__"
$ ./install.sh

I was successfully build Torch with CUDA10 with the above steps.

I have followed all the steps. My system gets stuck when I type

$cat > atomic.patch
diff
> diff --git a/lib/THC/THCAtomics.cuh b/lib/THC/THCAtomics.cuh
> index 400875c..ccb7a1c 100644
> --- a/lib/THC/THCAtomics.cuh
> +++ b/lib/THC/THCAtomics.cuh
> @@ -94,6 +94,7 @@ static inline __device__ void atomicAdd(long *address, long val) {
>  }
>  
>  #ifdef CUDA_HALF_TENSOR
> +#if !(__CUDA_ARCH__ >= 700 || !defined(__CUDA_ARCH__) )
>  static inline  __device__ void atomicAdd(half *address, half val) {
>    unsigned int * address_as_ui =
>        (unsigned int *) ((char *)address - ((size_t)address & 2));
> @@ -117,6 +118,7 @@ static inline  __device__ void atomicAdd(half *address, half val) {
>     } while (assumed != old);
>  }
>  #endif
> +#endif
>  
mw66 commented 2 years ago

cd extras/cutorch

it should be:

cd extra/cutorch