SHI-Labs / NATTEN

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!
https://shi-labs.com/natten/
Other
347 stars 27 forks source link

Failed to install natten on Python 3.9(Windows) #127

Closed QiumanZeng closed 1 day ago

QiumanZeng commented 4 months ago

Windows 10 my torch vision==2.2.2 cuda vison==11.5 while i am trying to install natten with code:pip install natten, it is keeping said: Building wheel for natten (setup.py) ... error error: subprocess-exited-with-error

× python setup.py bdist_wheel did not run successfully. │ exit code: 1 ╰─> [94 lines of output] Building NATTEN with CUDA 118 Building NATTEN for SM: 7.5 Number of workers: 4 running bdist_wheel running build running build_py creating build creating build\lib.win-amd64-cpython-39 creating build\lib.win-amd64-cpython-39\natten copying src\natten\context.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\flops.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\functional.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\na1d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\na2d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\na3d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\natten1d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\natten2d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\natten3d.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\nested.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\ops.py -> build\lib.win-amd64-cpython-39\natten copying src\natten\types.py -> build\lib.win-amd64-cpython-39\natten copying src\natten__init.py -> build\lib.win-amd64-cpython-39\natten creating build\lib.win-amd64-cpython-39\natten\utils copying src\natten/utils\checks.py -> build\lib.win-amd64-cpython-39\natten/utils copying src\natten/utils\log.py -> build\lib.win-amd64-cpython-39\natten/utils copying src\natten/utils\misc.py -> build\lib.win-amd64-cpython-39\natten/utils copying src\natten/utils\tensor.py -> build\lib.win-amd64-cpython-39\natten/utils copying src\natten/utils\testing.py -> build\lib.win-amd64-cpython-39\natten/utils copying src\natten/utils\init.py -> build\lib.win-amd64-cpython-39\natten/utils creating build\lib.win-amd64-cpython-39\natten\autotuner copying src\natten/autotuner\fna_backward.py -> build\lib.win-amd64-cpython-39\natten/autotuner copying src\natten/autotuner\fna_forward.py -> build\lib.win-amd64-cpython-39\natten/autotuner copying src\natten/autotuner\misc.py -> build\lib.win-amd64-cpython-39\natten/autotuner copying src\natten/autotuner\init.py -> build\lib.win-amd64-cpython-39\natten/autotuner creating build\lib.win-amd64-cpython-39\natten\autotuner\configs copying src\natten/autotuner/configs\fna_backward_128x128.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\fna_backward_128x64.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\fna_backward_64x64.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\fna_forward_32x128.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\fna_forward_64x128.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\fna_forward_64x64.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs copying src\natten/autotuner/configs\init.py -> build\lib.win-amd64-cpython-39\natten/autotuner/configs running build_ext Current arch list: [75] (max: 75) CMake Error: CMake was unable to find a build program corresponding to "Ninja". CMAKE_MAKE_PROGRAM is not set. You probably need to select a different build tool. CMake Error: CMAKE_CXX_COMPILER not set, after EnableLanguage CMake Error: CMAKE_CUDA_COMPILER not set, after EnableLanguage -- Configuring incomplete, errors occurred! Traceback (most recent call last): File "", line 2, in File "", line 34, in File "C:\Users\manye\AppData\Local\Temp\pip-install-idhyanei\natten_1b47b9d9376a4b0eb96a12681a16f8ba\setup.py", line 243, in setup( File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools\init__.py", line 107, in setup return distutils.core.setup(**attrs) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\core.py", line 185, in setup return run_commands(dist) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\core.py", line 201, in run_commands dist.run_commands() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\dist.py", line 969, in run_commands self.run_command(cmd) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools\dist.py", line 1234, in run_command super().run_command(command) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\dist.py", line 988, in run_command cmd_obj.run() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\wheel\bdist_wheel.py", line 364, in run self.run_command("build") File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\cmd.py", line 318, in run_command self.distribution.run_command(command) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools\dist.py", line 1234, in run_command super().run_command(command) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\dist.py", line 988, in run_command cmd_obj.run() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\command\build.py", line 131, in run self.run_command(cmd_name) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\cmd.py", line 318, in run_command self.distribution.run_command(command) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools\dist.py", line 1234, in run_command super().run_command(command) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\dist.py", line 988, in run_command cmd_obj.run() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools\command\build_ext.py", line 84, in run _build_ext.run(self) File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\command\build_ext.py", line 345, in run self.build_extensions() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\command\build_ext.py", line 467, in build_extensions self._build_extensions_serial() File "C:\Users\manye.conda\envs\swinfuse\lib\site-packages\setuptools_distutils\command\build_ext.py", line 493, in _build_extensions_serial self.build_extension(ext) File "C:\Users\manye\AppData\Local\Temp\pip-install-idhyanei\natten_1b47b9d9376a4b0eb96a12681a16f8ba\setup.py", line 219, in build_extension subprocess.check_call( File "C:\Users\manye.conda\envs\swinfuse\lib\subprocess.py", line 373, in check_call raise CalledProcessError(retcode, cmd) subprocess.CalledProcessError: Command '['cmake', 'C:\Users\manye\AppData\Local\Temp\pip-install-idhyanei\natten_1b47b9d9376a4b0eb96a12681a16f8ba\csrc', '-DPYTHON_PATH=C:\Users\manye\.conda\envs\swinfuse\python.exe', '-DOUTPUT_FILE_NAME=natten\libnatten.cp39-win_amd64', '-DNATTEN_CUDA_ARCH_LIST=75-real', '-DNATTEN_IS_WINDOWS=1', '-DNATTEN_IS_MAC=0', '-DIS_LIBTORCH_BUILT_WITH_CXX11_ABI=0', '-DNATTEN_WITH_AVX=1', '-DNATTEN_WITH_CUDA=1', '-DNATTEN_WITH_CUTLASS=1', '-DPY_LIB_DIR=C:\Users\manye\.conda\envs\swinfuse\libs', '-G Ninja', '-DCMAKE_BUILD_TYPE=Release']' returned non-zero exit status 1. [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for natten Running setup.py clean for natten Failed to build natten ERROR: Could not build wheels for natten, which is required to install pyproject.toml-based projects

how can i solve this? please help me

alihassanijr commented 4 months ago

Could you check if you have cmake installed via pip, and if so, uninstall it?

pip3 uninstall cmake

It looks like cmake isn't finding the ninja build system. The only other explanation could be an incomplete Visual Studio install. Could kindly try running the repair tool and see if there's anything missing?

justin4ai commented 4 months ago

The same error occurs in Linux! Any updates other than uninstalling cmake?

alihassanijr commented 4 months ago

@justin4ai Could you please share the full error? I can't imagine why this would be an issue on Linux.

justin4ai commented 4 months ago

@alihassanijr Thank you for helping me out! the full error is crazily long.. so I only attach some of it - still it is very long.

BlockSizeI, kBlockSizeJ, kMaxK, kAllowDeltaCompute>::OutputFragments &, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim, CausalMask, scalart, ArchTag, kIsAligned, kBlockSizeI, kBlockSizeJ, kMaxK, kAllowDeltaCompute>::Params &, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim, CausalMask, scalart, ArchTag, kIsAligned, kBlockSizeI, kBlockSizeJ, kMaxK, kAllowDeltaCompute>::Dim, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim, CausalMask, scalart, ArchTag, kIsAligned, kBlockSizeI, kBlockSizeJ, kMaxK_, kAllowDeltaCompute>::Dim, __nv_bool, __nv_bool, uint8_t, uint8t) [with NADim=3, CausalMask=natten::cuda::fna::CausalMask<false, false, false>, scalart=cutlass::bfloat16t, ArchTag=cutlass::arch::Sm80, kIsAligned=true, kBlockSizeI=128, kBlockSizeJ=64, kMaxK=65536, kAllowDeltaCompute=false]" /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/kernelbackward.h(1449): here instantiation of "void natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim, CausalMask, scalart, ArchTag, kIsAligned, kBlockSizeI, kBlockSizeJ, kMaxK_, kAllowDeltaCompute>::attentionkernel(natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim, CausalMask, scalart, ArchTag, kIsAligned, kBlockSizeI, kBlockSizeJ, kMaxK, kAllowDeltaCompute>::Params) [with NADim=3, CausalMask=natten::cuda::fna::CausalMask<false, false, false>, scalart=cutlass::bfloat16t, ArchTag=cutlass::arch::Sm80, kIsAligned=true, kBlockSizeI=128, kBlockSizeJ=64, kMaxK=65536, kAllowDeltaCompute=false]" /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/fna/source_30.cu(651): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/gemm/custom_mma_multistage.h(259): warning: missing return statement at end of non-void function "cutlass::gemm::threadblock::CustomMmaMultistage<Shape_, IteratorA_, SmemIteratorA_, CacheOpA, IteratorB_, SmemIteratorB_, CacheOpB, ElementC_, LayoutC_, Policy_, Stages, SharedMemoryClear, kMaxK, Enable>::set_zero_outside_bounds [with Shape_=cutlass::gemm::GemmShape<128, 64, 32>, IteratorA_=cutlass::transform::threadblock::CustomPredicatedTileAccessIterator<3, cutlass::MatrixShape<128, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajor, 1, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 128>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, cutlass::Array<cutlass::bfloat16_t, 8, false>>, SmemIteratorA_=cutlass::transform::threadblock::RegularTileAccessIterator<cutlass::MatrixShape<128, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajorTensorOpMultiplicandCrosswise<16, 32>, 0, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 128>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, 16>, CacheOpA=cutlass::arch::CacheOperation::Global, IteratorB_=cutlass::transform::threadblock::CustomPredicatedTileAccessIterator<3, cutlass::MatrixShape<32, 64>, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, 0, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 64>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, cutlass::Array<cutlass::bfloat16_t, 8, false>>, SmemIteratorB_=cutlass::transform::threadblock::RegularTileAccessIterator<cutlass::MatrixShape<32, 64>, cutlass::bfloat16_t, cutlass::layout::ColumnMajorTensorOpMultiplicandCrosswise<16, 32>, 1, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 64>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, 16>, CacheOpB=cutlass::arch::CacheOperation::Global, ElementC_=float, LayoutC_=cutlass::layout::RowMajor, Policy_=cutlass::gemm::threadblock::MmaPolicy<cutlass::gemm::warp::MmaTensorOp<cutlass::gemm::GemmShape<32, 32, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajorTensorOpMultiplicandCrosswise<16, 32>, cutlass::bfloat16_t, cutlass::layout::ColumnMajorTensorOpMultiplicandCrosswise<16, 32>, float, cutlass::layout::RowMajor, cutlass::gemm::warp::MmaTensorOpPolicy<cutlass::arch::Mma<cutlass::gemm::GemmShape<16, 8, 8>, 32, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, float, cutlass::layout::RowMajor, cutlass::arch::OpMultiplyAdd>, cutlass::MatrixShape<1, 1>>, 1, false, __nv_bool>, cutlass::MatrixShape<0, 0>, cutlass::MatrixShape<0, 0>, 1>, Stages=3, SharedMemoryClear=cutlass::gemm::SharedMemoryClearOption::kNone, kMaxK=65536, Enable=__nv_bool]"
            detected during:
              instantiation of "__nv_bool cutlass::gemm::threadblock::CustomMmaMultistage<Shape_, IteratorA_, SmemIteratorA_, CacheOpA, IteratorB_, SmemIteratorB_, CacheOpB, ElementC_, LayoutC_, Policy_, Stages, SharedMemoryClear, kMaxK, Enable>::set_zero_outside_bounds(__nv_bool) [with Shape_=cutlass::gemm::GemmShape<128, 64, 32>, IteratorA_=cutlass::transform::threadblock::CustomPredicatedTileAccessIterator<3, cutlass::MatrixShape<128, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajor, 1, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 128>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, cutlass::Array<cutlass::bfloat16_t, 8, false>>, SmemIteratorA_=cutlass::transform::threadblock::RegularTileAccessIterator<cutlass::MatrixShape<128, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajorTensorOpMultiplicandCrosswise<16, 32>, 0, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 128>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, 16>, CacheOpA=cutlass::arch::CacheOperation::Global, IteratorB_=cutlass::transform::threadblock::CustomPredicatedTileAccessIterator<3, cutlass::MatrixShape<32, 64>, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, 0, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 64>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, cutlass::Array<cutlass::bfloat16_t, 8, false>>, SmemIteratorB_=cutlass::transform::threadblock::RegularTileAccessIterator<cutlass::MatrixShape<32, 64>, cutlass::bfloat16_t, cutlass::layout::ColumnMajorTensorOpMultiplicandCrosswise<16, 32>, 1, cutlass::transform::PitchLinearWarpRakedThreadMap<cutlass::layout::PitchLinearShape<32, 64>, 256, cutlass::layout::PitchLinearShape<4, 8>, 8>, 16>, CacheOpB=cutlass::arch::CacheOperation::Global, ElementC_=float, LayoutC_=cutlass::layout::RowMajor, Policy_=cutlass::gemm::threadblock::MmaPolicy<cutlass::gemm::warp::MmaTensorOp<cutlass::gemm::GemmShape<32, 32, 32>, cutlass::bfloat16_t, cutlass::layout::RowMajorTensorOpMultiplicandCrosswise<16, 32>, cutlass::bfloat16_t, cutlass::layout::ColumnMajorTensorOpMultiplicandCrosswise<16, 32>, float, cutlass::layout::RowMajor, cutlass::gemm::warp::MmaTensorOpPolicy<cutlass::arch::Mma<cutlass::gemm::GemmShape<16, 8, 8>, 32, cutlass::bfloat16_t, cutlass::layout::RowMajor, cutlass::bfloat16_t, cutlass::layout::ColumnMajor, float, cutlass::layout::RowMajor, cutlass::arch::OpMultiplyAdd>, cutlass::MatrixShape<1, 1>>, 1, false, __nv_bool>, cutlass::MatrixShape<0, 0>, cutlass::MatrixShape<0, 0>, 1>, Stages=3, SharedMemoryClear=cutlass::gemm::SharedMemoryClearOption::kNone, kMaxK=65536, Enable=__nv_bool]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/kernel_backward.h(1983): here
              instantiation of "void natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::processBlockIJ(natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::SharedStorage &, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::OutputFragments &, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::Params &, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::Dim, natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::Dim, __nv_bool, __nv_bool, uint8_t, uint8_t) [with NADim_=3, CausalMask=natten::cuda::fna::CausalMask<false, false, false>, scalar_t_=cutlass::bfloat16_t, ArchTag_=cutlass::arch::Sm80, kIsAligned_=true, kBlockSizeI_=128, kBlockSizeJ_=64, kMaxK_=65536, kAllowDeltaCompute=false]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/kernel_backward.h(1449): here
              instantiation of "void natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::attention_kernel(natten::cuda::fna::FusedNeighborhoodAttentionBackwardKernel<NADim_, CausalMask, scalar_t_, ArchTag_, kIsAligned_, kBlockSizeI_, kBlockSizeJ_, kMaxK_, kAllowDeltaCompute>::Params) [with NADim_=3, CausalMask=natten::cuda::fna::CausalMask<false, false, false>, scalar_t_=cutlass::bfloat16_t, ArchTag_=cutlass::arch::Sm80, kIsAligned_=true, kBlockSizeI_=128, kBlockSizeJ_=64, kMaxK_=65536, kAllowDeltaCompute=false]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/fna/source_30.cu(651): here

  [ 28%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_31.cu.o
  [ 29%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_4.cu.o
  [ 30%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_5.cu.o
  [ 31%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_6.cu.o
  [ 32%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_7.cu.o
  [ 33%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_8.cu.o
  [ 34%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/fna/source_9.cu.o
  [ 35%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm70/source_0.cu.o
  [ 36%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm70/source_1.cu.o
  [ 37%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm75/source_0.cu.o
  [ 38%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm75/source_1.cu.o
  [ 39%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm80/source_0.cu.o
  [ 40%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/1d/sm80/source_1.cu.o
  [ 41%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm70/source_0.cu.o
  [ 42%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm70/source_1.cu.o
  [ 43%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm70/source_2.cu.o
  [ 44%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm70/source_3.cu.o
  [ 45%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm75/source_0.cu.o
  [ 46%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm75/source_1.cu.o
  [ 47%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm75/source_2.cu.o
  [ 48%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm75/source_3.cu.o
  [ 50%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_0.cu.o
  [ 51%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_1.cu.o
  [ 52%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_10.cu.o
  [ 53%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_11.cu.o
  [ 54%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_12.cu.o
  [ 55%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_13.cu.o
  [ 56%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_14.cu.o
  [ 57%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_15.cu.o
  [ 58%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_16.cu.o
  [ 59%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_17.cu.o
  [ 60%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_18.cu.o
  [ 61%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_19.cu.o
  [ 62%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_2.cu.o
  [ 63%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_20.cu.o
  [ 64%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_21.cu.o
  [ 65%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_22.cu.o
  [ 66%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_23.cu.o
  [ 67%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_24.cu.o
  [ 68%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_25.cu.o
  [ 69%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_26.cu.o
  [ 70%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_27.cu.o
  [ 71%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_28.cu.o
  [ 72%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_29.cu.o
  [ 73%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_3.cu.o
  [ 74%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_30.cu.o
  [ 75%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_31.cu.o
  [ 76%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_4.cu.o
  [ 77%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_5.cu.o
  [ 78%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_6.cu.o
  [ 79%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_7.cu.o
  [ 80%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_8.cu.o
  [ 81%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/gemm/2d/sm80/source_9.cu.o
  [ 82%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/naive/source_0.cu.o
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(311): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_start [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_start(int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(177): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::PointwiseNeighborhood1DFull<natten::float64, natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(323): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1D<Args_>::operator()(int32_t, cudaStream_t, __nv_bool, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with Args_=natten::naive::ArgumentPack1D<natten::float64, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_0.cu(57): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(319): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_end [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_end(int32_t, int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(178): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::PointwiseNeighborhood1DFull<natten::float64, natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(323): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1D<Args_>::operator()(int32_t, cudaStream_t, __nv_bool, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with Args_=natten::naive::ArgumentPack1D<natten::float64, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_0.cu(57): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(338): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_pb_start [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_pb_start(int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(191): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::PointwiseNeighborhood1DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::PointwiseNeighborhood1DFull<natten::float64, natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/pointwise_neighborhood_1d.cuh(323): here
              instantiation of "void natten::cuda::naive::PointwiseNeighborhood1D<Args_>::operator()(int32_t, cudaStream_t, __nv_bool, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with Args_=natten::naive::ArgumentPack1D<natten::float64, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_0.cu(57): here

  [ 83%] Building CUDA object CMakeFiles/natten.dir/autogen/src/cuda/naive/source_1.cu.o
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(311): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_start [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_start(int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/neighborhood_neighborhood_3d.cuh(163): here
              instantiation of "void natten::cuda::naive::NeighborhoodNeighborhood3DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::NeighborhoodNeighborhood3DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack3D<natten::float64, true, false, false>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::NeighborhoodNeighborhood3DFull<natten::float64, natten::naive::ArgumentPack3D<natten::float64, true, false, false>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/neighborhood_neighborhood_3d.cuh(328): here
              instantiation of "void natten::cuda::naive::NeighborhoodNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float64, true, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(38): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(319): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_end [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_window_end(int32_t, int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/neighborhood_neighborhood_3d.cuh(166): here
              instantiation of "void natten::cuda::naive::NeighborhoodNeighborhood3DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::NeighborhoodNeighborhood3DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack3D<natten::float64, true, false, false>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::NeighborhoodNeighborhood3DFull<natten::float64, natten::naive::ArgumentPack3D<natten::float64, true, false, false>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/neighborhood_neighborhood_3d.cuh(328): here
              instantiation of "void natten::cuda::naive::NeighborhoodNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float64, true, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(38): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(276): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_backward_window_start [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_backward_window_start(int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_1d.cuh(131): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood1DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood1DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood1DFull<natten::float64, natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_1d.cuh(251): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood1D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with Args_=natten::naive::ArgumentPack1D<natten::float64, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(912): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(286): warning: missing return statement at end of non-void function "natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_backward_window_end [with IsCausal_=true]"
            detected during:
              instantiation of "int32_t natten::cuda::naive::NeighborhoodMask<IsCausal_>::get_backward_window_end(int32_t) [with IsCausal_=true]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_1d.cuh(132): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood1DFull<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood1DFull<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float64, CausalMask_=natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask]"
  (79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood1DFull<natten::float64, natten::naive::ArgumentPack1D<natten::float64, true>::CausalMask>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_1d.cuh(251): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood1D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with Args_=natten::naive::ArgumentPack1D<natten::float64, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(912): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack2D<natten::float16, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::float16, natten::naive::ArgumentPack2D<natten::float16, false, false>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::float16, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1294): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack2D<natten::float16, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::float16, natten::naive::ArgumentPack2D<natten::float16, false, true>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::float16, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1326): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack2D<natten::float16, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::float16, natten::naive::ArgumentPack2D<natten::float16, true, false>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::float16, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1358): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack2D<natten::float16, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::float16, natten::naive::ArgumentPack2D<natten::float16, true, true>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::float16, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1390): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack2D<natten::bfloat16, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::bfloat16, natten::naive::ArgumentPack2D<natten::bfloat16, false, false>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::bfloat16, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1422): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack2D<natten::bfloat16, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::bfloat16, natten::naive::ArgumentPack2D<natten::bfloat16, false, true>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::bfloat16, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1454): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack2D<natten::bfloat16, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::bfloat16, natten::naive::ArgumentPack2D<natten::bfloat16, true, false>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::bfloat16, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1486): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_2d.cuh(232): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood2DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack2D<natten::bfloat16, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood2DHalf<natten::bfloat16, natten::naive::ArgumentPack2D<natten::bfloat16, true, true>::CausalMask>]"
  (294): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood2D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t> &, const std::tuple<int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack2D<natten::bfloat16, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1518): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, false, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, false, false, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, false, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(1968): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, false, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, false, false, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, false, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2002): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, false, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, false, true, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, false, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2036): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, false, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, false, true, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, false, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2070): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, true, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, true, false, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, true, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2104): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, true, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, true, false, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, true, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2138): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, true, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, true, true, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, true, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2172): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::float16, CausalMask_=natten::naive::ArgumentPack3D<natten::float16, true, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::float16, natten::naive::ArgumentPack3D<natten::float16, true, true, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::float16, true, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2206): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, false, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, false, false, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, false, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2240): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, false, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, false, false, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, false, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2274): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, false, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, false, true, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, false, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2308): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, false, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, false, true, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, false, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2342): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, true, false, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, true, false, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, true, false, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2376): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, true, false, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, true, false, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, true, false, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2410): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, true, true, false>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, true, true, false>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, true, true, false>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2444): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/inverse_neighborhood_3d.cuh(264): warning: variable "weightsIndex" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::launch(natten::cuda::naive::InverseNeighborhood3DHalf<scalar_t, CausalMask_>::Params) [with scalar_t=natten::bfloat16, CausalMask_=natten::naive::ArgumentPack3D<natten::bfloat16, true, true, true>::CausalMask]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/naive/natten_commons.cuh(79): here
              instantiation of "void natten::cuda::naive::launch_cuda_kernel<KernelTemplate>(KernelTemplate::Params) [with KernelTemplate=natten::cuda::naive::InverseNeighborhood3DHalf<natten::bfloat16, natten::naive::ArgumentPack3D<natten::bfloat16, true, true, true>::CausalMask>]"
  (335): here
              instantiation of "void natten::cuda::naive::InverseNeighborhood3D<Args_>::operator()(int32_t, cudaStream_t, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, int32_t, int64_t, int64_t, int64_t, int64_t, int64_t, const std::tuple<int32_t, int32_t, int32_t> &, const std::tuple<int32_t, int32_t, int32_t> &) [with Args_=natten::naive::ArgumentPack3D<natten::bfloat16, true, true, true>]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/autogen/src/cuda/naive/source_1.cu(2478): here

  [ 84%] Building CXX object CMakeFiles/natten.dir/natten.cpp.o
  [ 85%] Building CXX object CMakeFiles/natten.dir/src/config.cpp.o
  [ 86%] Building CUDA object CMakeFiles/natten.dir/src/cuda/fna/clear_workspace.cu.o
  [ 87%] Building CXX object CMakeFiles/natten.dir/src/pytorch/compute_delta.cpp.o
  [ 88%] Building CXX object CMakeFiles/natten.dir/src/pytorch/cpu/compute_delta.cpp.o
  [ 89%] Building CXX object CMakeFiles/natten.dir/src/pytorch/cpu/na1d.cpp.o
  [ 90%] Building CXX object CMakeFiles/natten.dir/src/pytorch/cpu/na2d.cpp.o
  [ 91%] Building CXX object CMakeFiles/natten.dir/src/pytorch/cpu/na3d.cpp.o
  [ 92%] Building CUDA object CMakeFiles/natten.dir/src/pytorch/cuda/compute_delta.cu.o
  [ 93%] Building CUDA object CMakeFiles/natten.dir/src/pytorch/cuda/na1d.cu.o
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h: In constructor ‘torch::jit::slot_iterator_impl<Policy>::slot_iterator_impl(torch::jit::Module, bool, bool)’:
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:409:113: error: use of deleted function ‘torch::jit::Module::Module(torch::jit::Module&&)’
    409 |       bool return_module) // if true include root itself as the first thing
        |                                                                                                                 ^
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:93:1: note: declared here
     93 |   Module(Module&&) noexcept = default;
        | ^ ~~~~
  [ 94%] Building CUDA object CMakeFiles/natten.dir/src/pytorch/cuda/na2d.cu.o
  make[2]: *** [CMakeFiles/natten.dir/build.make:1418: CMakeFiles/natten.dir/src/pytorch/cuda/compute_delta.cu.o] Error 1
  make[2]: *** Waiting for unfinished jobs....
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_forward.cuh(87): warning: variable "has_rpb" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_forward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, void *, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int32_t>, BooleanTuple=std::tuple<__nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na1d.cuh(86): here
              instantiation of "void natten::cuda::na1d_forward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &, const std::tuple<__nv_bool> &, float, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na1d.cu(72): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_forward.cuh(90): warning: variable "launchKernel" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_forward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, void *, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int32_t>, BooleanTuple=std::tuple<__nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na1d.cuh(86): here
              instantiation of "void natten::cuda::na1d_forward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &, const std::tuple<__nv_bool> &, float, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na1d.cu(72): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_backward.cuh(99): warning: variable "launchKernel" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_backward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, IntTuple, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int32_t>, BooleanTuple=std::tuple<__nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na1d.cuh(155): here
              instantiation of "void natten::cuda::na1d_backward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, const std::tuple<int32_t> &, const std::tuple<int32_t> &, const std::tuple<__nv_bool> &, float, const std::tuple<int32_t> &, const std::tuple<int32_t> &, const std::tuple<int32_t> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na1d.cu(153): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_forward.cuh(87): warning: variable "has_rpb" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_forward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, void *, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int, int>, BooleanTuple=std::tuple<__nv_bool, __nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na2d.cuh(86): here
              instantiation of "void natten::cuda::na2d_forward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, const std::tuple<int, int> &, const std::tuple<int, int> &, const std::tuple<__nv_bool, __nv_bool> &, float, const std::tuple<int, int> &, const std::tuple<int, int> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na2d.cu(73): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_forward.cuh(90): warning: variable "launchKernel" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_forward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, void *, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int, int>, BooleanTuple=std::tuple<__nv_bool, __nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na2d.cuh(86): here
              instantiation of "void natten::cuda::na2d_forward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, const std::tuple<int, int> &, const std::tuple<int, int> &, const std::tuple<__nv_bool, __nv_bool> &, float, const std::tuple<int, int> &, const std::tuple<int, int> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na2d.cu(73): here

  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/fna/fna_backward.cuh(99): warning: variable "launchKernel" was declared but never referenced
            detected during:
              instantiation of "void natten::cuda::fna::fna_backward_generic<T,IntTuple,BooleanTuple,MemoryAllocator>(int, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, void *, void *, void *, void *, int32_t, IntTuple, int32_t, int32_t, int32_t, IntTuple, IntTuple, BooleanTuple, float, IntTuple, IntTuple, IntTuple) [with T=natten::float64, IntTuple=std::tuple<int, int>, BooleanTuple=std::tuple<__nv_bool, __nv_bool>, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/./include/natten/cuda/na2d.cuh(156): here
              instantiation of "void natten::cuda::na2d_backward<T,MemoryAllocator>(int32_t, size_t, cudaStream_t, MemoryAllocator, void *, void *, void *, void *, void *, void *, void *, void *, void *, void *, int32_t, int32_t, int32_t, int32_t, int32_t, const std::tuple<int, int> &, const std::tuple<int, int> &, const std::tuple<__nv_bool, __nv_bool> &, float, const std::tuple<int, int> &, const std::tuple<int, int> &, const std::tuple<int, int> &) [with T=natten::float64, MemoryAllocator=lambda [](void **, int64_t, __nv_bool)->void]"
  /tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/csrc/src/pytorch/cuda/na2d.cu(157): here

  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h: In constructor ‘torch::jit::slot_iterator_impl<Policy>::slot_iterator_impl(torch::jit::Module, bool, bool)’:
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:409:113: error: use of deleted function ‘torch::jit::Module::Module(torch::jit::Module&&)’
    409 |       bool return_module) // if true include root itself as the first thing
        |                                                                                                                 ^
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:93:1: note: declared here
     93 |   Module(Module&&) noexcept = default;
        | ^ ~~~~
  make[2]: *** [CMakeFiles/natten.dir/build.make:1433: CMakeFiles/natten.dir/src/pytorch/cuda/na1d.cu.o] Error 1
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h: In constructor ‘torch::jit::slot_iterator_impl<Policy>::slot_iterator_impl(torch::jit::Module, bool, bool)’:
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:409:113: error: use of deleted function ‘torch::jit::Module::Module(torch::jit::Module&&)’
    409 |       bool return_module) // if true include root itself as the first thing
        |                                                                                                                 ^
  /home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/torch/include/torch/csrc/jit/api/module.h:93:1: note: declared here
     93 |   Module(Module&&) noexcept = default;
        | ^ ~~~~
  make[2]: *** [CMakeFiles/natten.dir/build.make:1448: CMakeFiles/natten.dir/src/pytorch/cuda/na2d.cu.o] Error 1
  make[1]: *** [CMakeFiles/Makefile2:83: CMakeFiles/natten.dir/all] Error 2
  make: *** [Makefile:91: all] Error 2
  Traceback (most recent call last):
    File "<string>", line 2, in <module>
    File "<pip-setuptools-caller>", line 34, in <module>
    File "/tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/setup.py", line 243, in <module>
      setup(
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/__init__.py", line 104, in setup
      return distutils.core.setup(**attrs)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 184, in setup
      return run_commands(dist)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/core.py", line 200, in run_commands
      dist.run_commands()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 969, in run_commands
      self.run_command(cmd)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/dist.py", line 967, in run_command
      super().run_command(command)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
      cmd_obj.run()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/wheel/bdist_wheel.py", line 368, in run
      self.run_command("build")
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 316, in run_command
      self.distribution.run_command(command)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/dist.py", line 967, in run_command
      super().run_command(command)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
      cmd_obj.run()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/command/build.py", line 132, in run
      self.run_command(cmd_name)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/cmd.py", line 316, in run_command
      self.distribution.run_command(command)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/dist.py", line 967, in run_command
      super().run_command(command)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/dist.py", line 988, in run_command
      cmd_obj.run()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/command/build_ext.py", line 91, in run
      _build_ext.run(self)
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 359, in run
      self.build_extensions()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 479, in build_extensions
      self._build_extensions_serial()
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/site-packages/setuptools/_distutils/command/build_ext.py", line 505, in _build_extensions_serial
      self.build_extension(ext)
    File "/tmp/pip-install-qicizp_0/natten_a39f7368d78045c489e6297d6b626e00/setup.py", line 230, in build_extension
      subprocess.check_call(["cmake", *cmake_build_args])
    File "/home/justin/anaconda3/envs/stylenat/lib/python3.10/subprocess.py", line 369, in check_call
      raise CalledProcessError(retcode, cmd)
  subprocess.CalledProcessError: Command '['cmake', '--build', 'build/lib.linux-x86_64-cpython-310', '-j', '3']' returned non-zero exit status 2.
  [end of output]

note: This error originates from a subprocess, and is likely not a problem with pip. ERROR: Failed building wheel for natten Running setup.py clean for natten Failed to build natten ERROR: Could not build wheels for natten, which is required to install pyproject.toml-based projects

.. During pip install natten!

alihassanijr commented 1 day ago

Closing due to inactivity. Feel free to reopen if you still have questions.