ddemidov / vexcl

VexCL is a C++ vector expression template library for OpenCL/CUDA/OpenMP
http://vexcl.readthedocs.org
MIT License
699 stars 81 forks source link

sort,fft,simple_cmake_build. test failed #273

Open zsw667291 opened 4 years ago

zsw667291 commented 4 years ago

after cmake -DVEXCL_BUILD_TESTS=ON -DVEXCL_BUILD_EXAMPLES=ON .. & make I run the command: ctest --output-on-failure is shows image

total imfo :

  Start  1: boost_version

1/39 Test #1: boost_version .................... Passed 0.00 sec Start 2: types 2/39 Test #2: types ............................ Passed 0.00 sec Start 3: deduce 3/39 Test #3: deduce ........................... Passed 0.12 sec Start 4: vector_create 4/39 Test #4: vector_create .................... Passed 1.53 sec Start 5: vector_copy 5/39 Test #5: vector_copy ...................... Passed 1.18 sec Start 6: vector_arithmetics 6/39 Test #6: vector_arithmetics ............... Passed 7.28 sec Start 7: vector_view 7/39 Test #7: vector_view ...................... Passed 5.08 sec Start 8: tensordot 8/39 Test #8: tensordot ........................ Passed 0.96 sec Start 9: vector_pointer 9/39 Test #9: vector_pointer ................... Passed 1.02 sec Start 10: tagged_terminal 10/39 Test #10: tagged_terminal .................. Passed 2.06 sec Start 11: temporary 11/39 Test #11: temporary ........................ Passed 2.89 sec Start 12: cast 12/39 Test #12: cast ............................. Passed 1.13 sec Start 13: multivector_create 13/39 Test #13: multivector_create ............... Passed 1.18 sec Start 14: multivector_arithmetics 14/39 Test #14: multivector_arithmetics .......... Passed 5.73 sec Start 15: multi_array 15/39 Test #15: multi_array ...................... Passed 1.67 sec Start 16: spmv 16/39 Test #16: spmv ............................. Passed 4.94 sec Start 17: sparse_matrices 17/39 Test #17: sparse_matrices .................. Passed 4.30 sec Start 18: stencil 18/39 Test #18: stencil .......................... Passed 2.25 sec Start 19: generator 19/39 Test #19: generator ........................ Passed 2.53 sec Start 20: mba 20/39 Test #20: mba .............................. Passed 1.56 sec Start 21: random 21/39 Test #21: random ........................... Passed 3.23 sec Start 22: sort stack smashing detected : /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/build/tests/sort terminated 22/39 Test #22: sort .............................Child aborted***Exception: 4.15 sec seed: 1564113402

  1. Intel(R) Gen9 HD Graphics NEO (Intel(R) OpenCL HD Graphics )
  2. Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz (Intel(R) CPU Runtime for OpenCL(TM) Applications )

Running 6 test cases... unknown location(0): fatal error in "sort_keys_vals_default": memory access violation at address: 0x7fc8c6ba604c: invalid permissions /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/sort.cpp(19): last checkpoint

  Start 23: scan

23/39 Test #23: scan ............................. Passed 2.19 sec Start 24: scan_by_key 24/39 Test #24: scan_by_key ...................... Passed 2.50 sec Start 25: reduce_by_key 25/39 Test #25: reduce_by_key .................... Passed 3.32 sec Start 26: logical 26/39 Test #26: logical .......................... Passed 1.97 sec Start 27: threads 27/39 Test #27: threads .......................... Passed 0.50 sec Start 28: svm 28/39 Test #28: svm .............................. Passed 0.91 sec Start 29: events 29/39 Test #29: events ........................... Passed 1.08 sec Start 30: image 30/39 Test #30: image ............................ Passed 0.49 sec Start 31: custom_kernel 31/39 Test #31: custom_kernel .................... Passed 0.53 sec Start 32: eval 32/39 Test #32: eval ............................. Passed 0.92 sec Start 33: constants 33/39 Test #33: constants ........................ Passed 0.54 sec Start 34: vector_io 34/39 Test #34: vector_io ........................ Passed 0.58 sec Start 35: reinterpret 35/39 Test #35: reinterpret ...................... Passed 0.60 sec Start 36: multiple_objects 36/39 Test #36: multiple_objects ................. Passed 0.11 sec Start 37: context 37/39 Test #37: context .......................... Passed 0.80 sec Start 38: fft 38/39 Test #38: fft ..............................***Failed 589.67 sec seed: 1564113423

  1. Intel(R) Gen9 HD Graphics NEO (Intel(R) OpenCL HD Graphics )
  2. Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz (Intel(R) CPU Runtime for OpenCL(TM) Applications )

Running 4 test cases... /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(51): error in "check_correctness": check std::sqrt(sum(pow(in - back, 2.0)) / N) < 1e-3 failed FFT(C2C) size=8x7x7x7 batch=1 FFT(C2C) size=39x11 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.94395441724135631} exceeds 1e-08 FFT(C2C) size=2 batch=18 FFT(C2C) size=4x2x4x4x3 batch=2 FFT(C2C) size=56 batch=29 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.89860764828448048} exceeds 1e-08 FFT(C2C) size=8x25 batch=1 FFT(C2C) size=4x2x5x1x2 batch=8 FFT(C2C) size=343 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.98066582460705121} exceeds 1e-08 FFT(C2C) size=686 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99787074634192552} exceeds 1e-08 FFT(C2C) size=2x6x2 batch=3 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.72831833395190182} exceeds 1e-08 FFT(C2C) size=11 batch=1 FFT(C2C) size=14 batch=27 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.89533388056748886} exceeds 1e-08 FFT(C2C) size=675 batch=2 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99648159699791061} exceeds 1e-08 FFT(C2C) size=49 batch=38 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.94322191040510139} exceeds 1e-08 FFT(C2C) size=7 batch=37 FFT(C2C) size=22x17 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99994622611138206} exceeds 1e-08 FFT(C2C) size=8x13x7 batch=1 FFT(C2C) size=2 batch=1 FFT(C2C) size=18 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.89155686087742625} exceeds 1e-08 FFT(C2C) size=30 batch=14 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.95200714302571798} exceeds 1e-08 FFT(C2C) size=1750 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99969813396220608} exceeds 1e-08 FFT(C2C) size=2520 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99755639599384982} exceeds 1e-08 FFT(C2C) size=36 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.88285387342337529} exceeds 1e-08 FFT(C2C) size=968 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99354682074628997} exceeds 1e-08 FFT(C2C) size=196 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.98498434970765647} exceeds 1e-08 FFT(C2C) size=36 batch=8 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.92230565103771189} exceeds 1e-08 FFT(C2C) size=8x2x14 batch=16 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.88962469553516799} exceeds 1e-08 FFT(C2C) size=5x10x12 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.95064176340919182} exceeds 1e-08 FFT(C2C) size=1 batch=4 FFT(C2C) size=2744 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99822941381961805} exceeds 1e-08 FFT(C2C) size=363 batch=2 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99373663999093731} exceeds 1e-08 FFT(C2C) size=3 batch=9 FFT(C2C) size=4x2x3x2x3 batch=1 FFT(C2C) size=32 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.92050564289270331} exceeds 1e-08 FFT(C2C) size=2x5x1x4 batch=1 FFT(C2C) size=60x10 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.98970543344186512} exceeds 1e-08 FFT(C2C) size=8x6x7x5 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.74016830891334084} exceeds 1e-08 FFT(C2C) size=2x6x1 batch=43 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.74915990831916046} exceeds 1e-08 FFT(C2C) size=2 batch=1 FFT(C2C) size=3 batch=14 FFT(C2C) size=2178 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.9995879129930918} exceeds 1e-08 FFT(C2C) size=1x6x2x7 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.71508578255285538} exceeds 1e-08 FFT(C2C) size=252 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.98562240255213895} exceeds 1e-08 FFT(C2C) size=600 batch=3 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99108093599288727} exceeds 1e-08 FFT(C2C) size=3360 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99921405578156408} exceeds 1e-08 FFT(C2C) size=1 batch=62 FFT(C2C) size=52 batch=7 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.94026653856749687} exceeds 1e-08 FFT(C2C) size=48x6 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.91813601358549313} exceeds 1e-08 FFT(C2C) size=18x5 batch=18 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.91678963033292238} exceeds 1e-08 FFT(C2C) size=462 batch=4 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99696600317084683} exceeds 1e-08 FFT(C2C) size=52x42 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99783042593481008} exceeds 1e-08 FFT(C2C) size=15x2x8 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.85188468509384574} exceeds 1e-08 FFT(C2C) size=44x22 batch=4 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.9938660523834435} exceeds 1e-08 FFT(C2C) size=559 batch=4 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.99993521045522527} exceeds 1e-08 FFT(C2C) size=4 batch=99 FFT(C2C) size=2x2x3x4x5 batch=1 FFT(C2C) size=90 batch=2 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.977187729833076} exceeds 1e-08 FFT(C2C) size=52 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.93345691017509747} exceeds 1e-08 FFT(C2C) size=624 batch=1 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.98258941068613026} exceeds 1e-08 FFT(C2C) size=314 batch=12 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.9999999761795707} exceeds 1e-08 FFT(C2C) size=166 batch=2 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){1.0000093990256771} exceeds 1e-08 FFT(C2C) size=1 batch=34 FFT(C2C) size=1 batch=1 FFT(C2C) size=3x1x1x1x5 batch=3 FFT(C2C) size=25 batch=6 FFT(C2C) size=3 batch=3 FFT(C2C) size=120 batch=7 /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/tests/fft.cpp(102): error in "test_dimensions": absolute value of rms(back, inp){0.95022524230223626} exceeds 1e-08

*** 45 failures detected in test suite "FastFourierTransform"

  Start 39: simple_cmake_build

39/39 Test #39: simple_cmake_build ...............***Failed 0.46 sec Internal cmake changing into directory: /home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/build/tests/simple Error: cmake execution failed The C compiler identification is GNU 7.4.0 The CXX compiler identification is GNU 7.4.0 Check for working C compiler: /usr/bin/cc Check for working C compiler: /usr/bin/cc -- works Detecting C compiler ABI info Detecting C compiler ABI info - done Detecting C compile features Detecting C compile features - done Check for working CXX compiler: /usr/bin/c++ Check for working CXX compiler: /usr/bin/c++ -- works Detecting CXX compiler ABI info Detecting CXX compiler ABI info - done Detecting CXX compile features Detecting CXX compile features - done Found VexCL::OpenCL Found VexCL::CUDA CMake Error at CMakeLists.txt:6 (vexcl_add_executables): Unknown CMake command "vexcl_add_executables".

Configuring incomplete, errors occurred! See also "/home/SENSETIME/zhushiwei/sense_work/SZ_PROJECTS/ddemidov/vexcl/build/tests/simple/CMakeFiles/CMakeOutput.log".

92% tests passed, 3 tests failed out of 39

Total Test time (real) = 662.04 sec

The following tests FAILED: 22 - sort (Child aborted) 38 - fft (Failed) 39 - simple_cmake_build (Failed) Errors while running CTest

What is the cause of this problem? look forward to your reply

ddemidov commented 4 years ago

Can you try this?

OCL_DEVICE=i7-8700 ctest --output-on-failure

This will exclude the HD Graphics device from the tests (well, in fact it will restrict the tests to the device that has "i7-8700" in the name, but the effect will be the same). I think opencl driver for Intel graphics devices has some limitations.

zsw667291 commented 4 years ago

i use command OCL_DEVICE=i7-8700 ctest --output-on-failure

fix the problem of fft , but the problem of sort & simple_cmake_build still exist image

ddemidov commented 4 years ago

Not sure why do you get the simple_cmake_build failure (could cmake configure go wrong? you could try to clean up the build directory and try to reconfigure), but I think that can be safely ignored.

I can reproduce the segfault error with Intel OpenCL, but not with NVIDIA CUDA or OpenCL. The sort code is ported from ModernGPU code, which targets NVIDIA CUDA, and is quite big, so I am not sure when I will have time to take a closer look at this.

I'll keep the issue open as a reminder for myself.