Closed zhichen3 closed 2 months ago
I'm getting CUDA Exception: Lane User Stack Overflow
when evaluating *(d_num_failed.copyToHost())
line 814 in castro_react.cpp in cuda-gdb
when I link, I get this message:
Stack size for entry function '_ZN5amrex13launch_globalILi256EZNS_6launchILi256EZNS_9ReduceOpsIJNS_11ReduceOpMinEEE4evalINS_10ReduceDataIJNS_10ValLocPairIdNS_7IntVectEEEEEEZNS4_4evalINS_8FabArrayINS_9FArrayBoxEEESA_ZN6Castro13estdt_burningEiEUliiiiE_EENSt9enable_ifIXsr5amrex10IsFabArrayIT_vEE5valueEvE4typeERKSI_RKS8_RT0_OT1_EUliiiE_EEvRKNS_3BoxERSI_RKSP_EUlvE_EEvimP11CUstream_stRKT0_EUlvE_EEvS13_' cannot be statically determined
so the compiler is telling us there is something up in that function
I'm able to reproduce this on my workstation with inputs.N14.coarse
(I don't have enough memory for the others).
fixed by eliminating recursion
I'm getting cuda errors on the very first step of the subchandra problem.
CUDA error 700 in file /home/zhi/github/amrex/Src/Base/AMReX_GpuDevice.cpp line 614: an illegal memory access was encountered
To reproduce, compile subchandra with
make -f GNUmakefile.nse_net USE_CUDA=TRUE USE_SIMPLIFIED_SDC=TRUE NETWORK_DIR=subch_base
With Backtrace: