rapidsai / raft

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
https://docs.rapids.ai/api/raft/stable/
Apache License 2.0
787 stars 195 forks source link

Raft Cpp Test Error #2476

Open kun429973 opened 1 month ago

kun429973 commented 1 month ago

raft: branch-22.12 Tesla V100S-PCIE-32GB

When I run test/sparse/dist_coo_spmv.cu, I get the following error:

unknown file: Failure C++ exception with description "CUDA error encountered at: file=/home/lyk/raft/cpp/include/raft/sparse/convert/detail/coo.cuh line=74: call='cudaGetLastError()', Reason=cudaErrorLaunchOutOfResources:too many resources requested for launch Obtained 17 stack frames

0 in ./SPARSE_DIST_TEST(+0x14964) [0x55a7803fd964]

1 in ./SPARSE_DIST_TEST(+0x148f6) [0x55a7803fd8f6]

2 in ./SPARSE_DIST_TEST(+0x178e6) [0x55a7804008e6]

3 in ./SPARSE_DIST_TEST(+0x286a4) [0x55a7804116a4]

4 in ./SPARSE_DIST_TEST(+0x23ba1) [0x55a78040cba1]

5 in ./SPARSE_DIST_TEST(+0x20bae) [0x55a780409bae]

6 in ./SPARSE_DIST_TEST(+0x1cc00) [0x55a780405c00]

7 in ./SPARSE_DIST_TEST(+0x4b7ae) [0x55a7804347ae]

8 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing8internal35HandleExceptionsInMethodIfSupportedINS_4TestEvEET0_PT_MS4_FS3_vEPKc+0x4e) [0x7f7c8de9db5e]

9 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing4Test3RunEv+0x64) [0x7f7c8de9ddb4]

10 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing8TestInfo3RunEv+0x19f) [0x7f7c8de9e1ff]

11 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing9TestSuite3RunEv+0xf4) [0x7f7c8de9e554]

12 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing8internal12UnitTestImpl11RunAllTestsEv+0x3bb) [0x7f7c8dea6fdb]

13 in /opt/conda/envs/rapids/lib/libgtest.so.1.13.0(_ZN7testing8UnitTest3RunEv+0x86) [0x7f7c8de9e766]

14 in /opt/conda/envs/rapids/lib/libgtest_main.so.1.13.0(main+0x3f) [0x7f7c8de4006f]

15 in /usr/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3) [0x7f7c67a79083]

16 in ./SPARSE_DIST_TEST(+0x729e) [0x55a7803f029e]

" thrown in SetUp(). [ FAILED ] SparseDistanceCOOSPMVTests/SparseDistanceCOOSPMVTestHashStrategyF.Result/0, where GetParam() = (120 ms) [ RUN ] SparseDistanceCOOSPMVTests/SparseDistanceCOOSPMVTestHashStrategyF.Result/1