kokkos / kokkos-kernels

Kokkos C++ Performance Portability Programming Ecosystem: Math Kernels - Provides BLAS, Sparse BLAS and Graph Kernels
Other
308 stars 97 forks source link

Nightly test failure, sparse_sptrsv_kokkos_complex_double_int_int_TestDevice with intel debug, and nvhpc/22.3 builds #2282

Closed ndellingwood closed 3 months ago

ndellingwood commented 3 months ago

Nightly tests are failing in in the sparse_sptrsv_kokkos_complex_double_int_int_TestDevice subtest with intel compilers and nvhpc with output:

11:27:53 8: [ RUN      ] openmp.sparse_sptrsv_kokkos_complex_double_int_int_TestDevice
11:27:53 8: /home/jenkins/node/workspace/KokkosKernels_OSRHEL7_Intel19_0_5_cpp17-dbg/kokkos-kernels/sparse/unit_test/Test_Sparse_sptrsv.hpp:193: Failure
11:27:53 8: Value of: lhs.extent(0)
11:27:53 8:   Actual: 5
11:27:53 8: Expected: sum
11:27:53 8: Which is: (0,0)
11:27:53 8: /home/jenkins/node/workspace/KokkosKernels_OSRHEL7_Intel19_0_5_cpp17-dbg/kokkos-kernels/sparse/unit_test/Test_Sparse_sptrsv.hpp:193: Failure
11:27:53 8: Value of: lhs.extent(0)
11:27:53 8:   Actual: 5
11:27:53 8: Expected: sum
11:27:53 8: Which is: (0,0)
11:27:53 8: /home/jenkins/node/workspace/KokkosKernels_OSRHEL7_Intel19_0_5_cpp17-dbg/kokkos-kernels/sparse/unit_test/Test_Sparse_sptrsv.hpp:193: Failure
11:27:53 8: Value of: lhs.extent(0)
11:27:53 8:   Actual: 5
11:27:53 8: Expected: sum
11:27:53 8: Which is: (0,0)
11:27:53 8: /home/jenkins/node/workspace/KokkosKernels_OSRHEL7_Intel19_0_5_cpp17-dbg/kokkos-kernels/sparse/unit_test/Test_Sparse_sptrsv.hpp:193: Failure
11:27:53 8: Value of: lhs.extent(0)
11:27:53 8:   Actual: 5
11:27:53 8: Expected: sum
11:27:53 8: Which is: (0,0)
...

This began following merge of #2280 , @jgfouca can you investigate?

Reproducer (sems-intel-19.0.5):

module load sems-cmake/3.23.1 sems-gcc/8.3.0 sems-intel/19.0.5

%KOKKOSKERNELS_PATH/cm_generate_makefile.bash --with-devices=Serial --arch=SNB --compiler=icpc --cxxflags="-g -Wall -Wunused-parameter -Wshadow -pedantic -Werror -Wsign-compare -Wtype-limits -Wuninitialized -diag-disable=1011 -diag-disable=869 " --cxxstandard="17"  --kokkos-path=$KOKKOS_PATH --kokkoskernels-path=$KOKKOSKERNELS_PATH --with-scalars='double,complex_double' --with-ordinals=int --with-offsets=int,size_t --with-layouts=LayoutLeft --with-tpls=    --with-options=disable_deprecated_code --no-examples --debug --disable-perftests
jgfouca commented 3 months ago

Yeah, I'm seeing similar errors in my block sptrsv PR. I'm wondering how that cleanup PR made it past AT testing.

ndellingwood commented 3 months ago

@jgfouca I don't think the intel/19 CI job tests a debug build?

jgfouca commented 3 months ago

They aren't but I'm seeing the same mishandling of complex data on one of the weaver jobs (but not the other for some reason).