First-order CUDA followup fix: explicit CC list for Jetson and Tegra platforms

griwodz commented 1 month ago

Description

Using CUDA as a language in CMake projects requires a CC selection in the variable CMAKE_CUDA_ARCHITECTURES.

The default here is the oldest CC supported by the installed nvcc. To have a better but not complete CC list for discrete GPUs, set CMAKE_CUDA_ARCHITECTURES to "all-major". Both fails for Tegras and Jetson because CMake guesses that these are ARM platforms with discrete GPUs.

Therefore, we test if the file /etc/nv_tegra_release exists. In this case, we guess that the intention is to compile for Tegras or Jetsons and set CMAKE_CUDA_ARCHITECTURES="53;62;72;87".

Features list

Select sensible CCs when PopSift is configured on a Tegra/Jetson to compile for a Tegra/Jetson.

Implementation remarks

We are not adding CC 32 to the list since it is deprecated. The current CUDA code may still work on CC32 platforms.

griwodz commented 1 month ago

Waiting to hhackbarth to confirm that it works on their Jetson version as well. I also learned that CMAKE_CUDA_ARCHITECTURES belongs before project() and not after.

griwodz commented 1 month ago

Hi @simogasp , the fix is confirmed in the bug report #160. I don't know if I'm able to follow up on the Python wrapper, but I'll try. I think that this fix is only beneficial for the develop branch in any case.

alicevision / popsift