hughperkins / cltorch

An OpenCL backend for torch.
Other
289 stars 26 forks source link

Cltorch Unit test failed #61

Closed HBQ15 closed 8 years ago

HBQ15 commented 8 years ago

I get the following when I ruin the unit test hanbaoquan@HBQM ~/torch/install/lib $ luajit -l cltorch -e 'cltorch.test()' running tests... aftter requiring cltorch.unit_storage Running 2 tests 1/2 test_get ............................................................ [WAIT] 1/2 test_get ............................................................ [ERROR] 2/2 test_basic .......................................................... [WAIT] 2/2 test_basic .......................................................... [ERROR]

Completed 0 asserts in 2 tests with 0 failures and 2 errors and 1 warning

test_get Function call failed Something went wrong: OpenCL library not found at /tmp/luarocks_cltorch-scm-1-7915/cltorch/cltorch/src/torch/generic/Storage.cpp:101 stack traceback: [C]: at 0x7f71e3b571f0 [C]: in function 'ClStorage' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:64: in function 'v' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:84: in function <...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:82> [C]: in function 'xpcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:476: in function '_pcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:436: in function '_run' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:354: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670


test_basic Function call failed No OpenCL-enabled devices available at /tmp/luarocks_cltorch-scm-1-7915/cltorch/cltorch/src/lib/THClGeneral.cpp:139 stack traceback: [C]: at 0x7f71e3b571f0 [C]: in function 'ClStorage' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:25: in function 'v' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:84: in function <...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:82> [C]: in function 'xpcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:476: in function '_pcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:436: in function '_run' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:354: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670


Should use TestSuite rather than plain lua table


luajit: .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:362: An error was found while running tests! stack traceback: [C]: in function 'assert' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:362: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670

I have installed all the requested libraries, clinfo showws: hanbaoquan@HBQM ~/torch/install/lib $ clinfo Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.0 AMD-APP (1912.5) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices

Platform Name: AMD Accelerated Parallel Processing Number of devices: 3 Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: AMD Radeon R9 200 Series Device Topology: PCI[ B#1, D#0, F#0 ] Max compute units: 44 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1000Mhz Address bits: 64 Max memory allocation: 2632946688 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 3867691072 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 2632946688 Max global variable size: 2369651968 Max global variable preferred total size: 3867691072 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities:
Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities:
Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties:
Out-of-Order: No Profiling : Yes Queue on Device properties:
Out-of-Order: Yes Profiling : Yes Platform ID: 0x7f70034b7a18 Name: Hawaii Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 1912.5 (VM) Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (1912.5) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_khr_gl_depth_images cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes

Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name:
Device Topology: PCI[ B#2, D#0, F#0 ] Max compute units: 44 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1050Mhz Address bits: 64 Max memory allocation: 3008913408 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 4251256704 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 3008913408 Max global variable size: 2708022016 Max global variable preferred total size: 4251256704 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities:
Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities:
Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties:
Out-of-Order: No Profiling : Yes Queue on Device properties:
Out-of-Order: Yes Profiling : Yes Platform ID: 0x7f70034b7a18 Name: Hawaii Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 1912.5 (VM) Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (1912.5) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_khr_gl_depth_images cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes

Device Type: CL_DEVICE_TYPE_CPU Vendor ID: 1002h Board name:
Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 8 Preferred vector width double: 4 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 8 Native vector width double: 4 Max clock frequency: 4200Mhz Address bits: 64 Max memory allocation: 8394331136 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 32768 Global memory size: 33577324544 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 4099363840 Max global variable size: 1879048192 Max global variable preferred total size: 1879048192 Max read/write image args: 64 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities:
Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities:
Execute OpenCL kernels: Yes Execute native function: Yes Queue on Host properties:
Out-of-Order: No Profiling : Yes Queue on Device properties:
Out-of-Order: No Profiling : No Platform ID: 0x7f70034b7a18 Name: Intel(R) Core(TM) i7-4770K CPU @ 3.50GHz Vendor: GenuineIntel Device OpenCL C version: OpenCL C 1.2 Driver version: 1912.5 (sse2,avx) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1912.5) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_spir cl_khr_gl_event

I'm not sure what I'm missing, please help. I'm using Linux mint 17.3. System specs are:

Intel 4770k AMD R9-290x 16Gb Ram

hughperkins commented 8 years ago

Please update to latest cltorch and try again. If that doesnt work please update to latest torch ,and try again, ie some combinations of:

luarocks install cltorch
luarocks install torch7
HBQ15 commented 8 years ago

Hi Hugh Thanks for the reply, I have run luarocks install cltorch a few times before this, I will try to install torch7 again and see

Thanks

hughperkins commented 8 years ago

Oh wait... it says 'OpenCL library not found', let me re-read a sec...

hughperkins commented 8 years ago

Hmmm, seems you have an OpenCL-enabled GPU available. But seems it cant find the OpenCL library. Can you provide the output of the following commands please ?

ls /etc/OpenCL/vendors
cat /etc/OpenCL/vendors/*
find /usr/lib -name '*opencl*'
HBQ15 commented 8 years ago

Hi Hugh Here are the outputs hanbaoquan@HBQM /etc/OpenCL/vendors $ ls amdocl32.icd amdocl64.icd

hanbaoquan@HBQM /etc/OpenCL/vendors $ ls amdocl32.icd amdocl64.icd

hanbaoquan@HBQM ~ $ find /usr/lib -name 'opencl' /usr/lib/libreoffice/program/libopencllo.so /usr/lib/libreoffice/share/config/soffice.cfg/cui/ui/optopenclpage.ui

Thank you for your help

hughperkins commented 8 years ago

ok, seems like you're missing libopencl.so, but you can do also please:

cat /etc/OpenCL/vendors/amdocl64.icd
cat /etc/OpenCL/vendors/amdocl32.icd
HBQ15 commented 8 years ago

Hi Hugh here's the output hanbaoquan@HBQM ~ $ cat /etc/OpenCL/vendors/amdocl64.icd libamdocl64.so

hanbaoquan@HBQM ~ $ cat /etc/OpenCL/vendors/amdocl32.icd libamdocl32.so

Thank you so much

hughperkins commented 8 years ago

Well.. you seem to be missing libOpenCL.so, but it's strange, because normally clinfo should need libOpenCL.so. For example, on my linux box:

$ ldd $(which clinfo)
    linux-vdso.so.1 =>  (0x00007ffca0398000)
    libOpenCL.so.1 => /usr/lib/x86_64-linux-gnu/libOpenCL.so.1 (0x00007fe160063000)
    libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fe15fc99000)
    libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007fe15fa94000)
    /lib64/ld-linux-x86-64.so.2 (0x00005651e4125000)

You can see it links with libOpenCL.so, in /usr/lib/ directory

I'm not sure if ldd works on Mac? Can you try doing ldd $(which clinfo), and see if that does anything? otherwise can you try something like otool $(which clinfo) ?

HBQ15 commented 8 years ago

Hi Hugh Here's the output

_hanbaoquan@HBQM ~ $ ldd $(which clinfo) linux-vdso.so.1 => (0x00007ffedc570000) librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00007f28d5207000) libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f28d4f01000) libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f28d4cfd000) libOpenCL.so.1 => /usr/lib/libOpenCL.so.1 (0x00007f28d4af6000) libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f28d48e0000) libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f28d46c2000) libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f28d42fd000) /lib64/ld-linux-x86-64.so.2 (0x00007f28d540f000) _

So it seems I do have the libOpenCL.so where it should be, but no idea why it's not being found. I installed the proprietary driver from AMD and the app SDK for openCL, I think everything should be included, I installed cltorch with no issue (as far as I could see). So I'm drawing blanks at the moment.

hughperkins commented 8 years ago

Oh... it's because in clew.c now I have:

#elif defined(__APPLE__)
    const char *path = "/Library/Frameworks/OpenCL.framework/OpenCL";
#else
    const char *path = "libOpenCL.so";
#endif
hughperkins commented 8 years ago

Addressed in 78f8af1 plausibly. Can you try installing the latest version of cltorch, and try again?

HBQ15 commented 8 years ago

Hi Hugh, I just updated to the latest version of cltorch and ran the test again, here's the output:

hanbaoquan@HBQM ~/torch $ luajit -l cltorch -e 'cltorch.test()' running tests... after requiring cltorch.unit_storage Running 2 tests 1/2 test_get ............................................................ [WAIT] 1/2 test_get ............................................................ [ERROR] 2/2 test_basic .......................................................... [WAIT] 2/2 test_basic .......................................................... [ERROR]

Completed 0 asserts in 2 tests with 0 failures and 2 errors

test_get Function call failed Something went wrong: OpenCL library not found at /tmp/luarocks_cltorch-scm-1-3477/cltorch/cltorch/src/torch/generic/Storage.cpp:101 stack traceback: [C]: at 0x7f918e7f71f0 [C]: in function 'ClStorage' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:64: in function 'v' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:84: in function <...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:82> [C]: in function 'xpcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:476: in function '_pcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:436: in function '_run' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:354: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670


test_basic Function call failed No OpenCL-enabled devices available at /tmp/luarocks_cltorch-scm-1-3477/cltorch/cltorch/src/lib/THClGeneral.cpp:139 stack traceback: [C]: at 0x7f918e7f71f0 [C]: in function 'ClStorage' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:25: in function 'v' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:84: in function <...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:82> [C]: in function 'xpcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:476: in function '_pcall' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:436: in function '_run' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:354: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670


luajit: .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:362: An error was found while running tests! stack traceback: [C]: in function 'assert' .../hanbaoquan/torch/install/share/lua/5.1/torch/Tester.lua:362: in function 'run' ...uan/torch/install/share/lua/5.1/cltorch/unit_storage.lua:91: in function 'test' .../hanbaoquan/torch/install/share/lua/5.1/cltorch/Test.lua:8: in function 'test' (command line):1: in main chunk [C]: at 0x00406670

hughperkins commented 8 years ago

Added some additional paths in https://github.com/hughperkins/clew/commit/ea11370417c1935e49f1f90e45089473334db824 Can you install latest cltorch, and try again please?

HBQ15 commented 8 years ago

Hi Hugh All tests now passed no errors, here's the output. Thank you sooo much!!

hanbaoquan@HBQM ~/torch $ luajit -l cltorch -e 'cltorch.test()' running tests... after requiring cltorch.unit_storage Running 2 tests 1/2 test_get ............................................................ [WAIT] Using Advanced Micro Devices, Inc. , OpenCL platform: AMD Accelerated Parallel Processing Using OpenCL device: Hawaii 1/2 test_get ............................................................ [PASS] 2/2 test_basic .......................................................... [WAIT] 2/2 test_basic .......................................................... [PASS] Completed 15 asserts in 2 tests with 0 failures and 0 errors

tester.errors 0

res true aftter requiring cltorch.unit_tensor Running 116 tests 1/116 outplace_div .................................................... [WAIT]

1/116 outplace_div .................................................... [PASS] 2/116 test_addcmul .................................................... [WAIT] 2/116 test_addcmul .................................................... [PASS] 3/116 outplace_tanh ................................................... [WAIT] 3/116 outplace_tanh ................................................... [PASS] 4/116 outplace_pow .................................................... [WAIT]

4/116 outplace_pow .................................................... [PASS] 5/116 inplace_tanh .................................................... [WAIT] 5/116 inplace_tanh .................................................... [PASS] 6/116 test_scatterFill ................................................ [WAIT] 6/116 test_scatterFill ................................................ [PASS] 7/116 test_prodall .................................................... [WAIT] THClReduceAll.cl build log: "/tmp/OCL19170T9.cl", line 9: warning: variable "in1" was declared but never referenced float *in1 = &_in1; ^

"/tmp/OCL19170T9.cl", line 10: warning: variable "out" was declared but never referenced float *out = &_out; ^

7/116 test_prodall .................................................... [PASS] 8/116 outplace_cpow ................................................... [WAIT] 8/116 outplace_cpow ................................................... [PASS] 9/116 inplace_atan .................................................... [WAIT] 9/116 inplace_atan .................................................... [PASS] 10/116 inplace_le ...................................................... [WAIT] 10/116 inplace_le ...................................................... [PASS] 11/116 test_equals ..................................................... [WAIT] 11/116 test_equals ..................................................... [PASS] 12/116 self_lt ......................................................... [WAIT]

12/116 self_lt ......................................................... [PASS] 13/116 test_prod ....................................................... [WAIT] 13/116 test_prod ....................................................... [PASS] 14/116 test_matrixwide ................................................. [WAIT] 14/116 test_matrixwide ................................................. [PASS] 15/116 inplace_sqrt .................................................... [WAIT] 15/116 inplace_sqrt .................................................... [PASS] 16/116 inplace_ceil .................................................... [WAIT] 16/116 inplace_ceil .................................................... [PASS] 17/116 test_scatter .................................................... [WAIT] 17/116 test_scatter .................................................... [PASS] 18/116 test_map ........................................................ [WAIT] 18/116 test_map ........................................................ [PASS] 19/116 outplace_sin .................................................... [WAIT] 19/116 outplace_sin .................................................... [PASS] 20/116 outplace_ge ..................................................... [WAIT]

20/116 outplace_ge ..................................................... [PASS] 21/116 outplace_add .................................................... [WAIT]

21/116 outplace_add .................................................... [PASS] 22/116 outplace_ne ..................................................... [WAIT]

22/116 outplace_ne ..................................................... [PASS] 23/116 test_get ........................................................ [WAIT] 23/116 test_get ........................................................ [PASS] 24/116 outplace_cdiv ................................................... [WAIT] 24/116 outplace_cdiv ................................................... [PASS] 25/116 inplace_log ..................................................... [WAIT] 25/116 inplace_log ..................................................... [PASS] 26/116 inplace_acos .................................................... [WAIT] 26/116 inplace_acos .................................................... [PASS] 27/116 outplace_mul .................................................... [WAIT]

27/116 outplace_mul .................................................... [PASS] 28/116 test_intpower ................................................... [WAIT] 28/116 test_intpower ................................................... [PASS] 29/116 inplace_cinv .................................................... [WAIT] 29/116 inplace_cinv .................................................... [PASS] 30/116 test_reduceAll .................................................. [WAIT] THClReduceAll.cl build log: "/tmp/OCL19170T39.cl", line 9: warning: variable "in1" was declared but never referenced float *in1 = &_in1; ^

"/tmp/OCL19170T39.cl", line 10: warning: variable "out" was declared but never referenced float *out = &_out; ^

30/116 test_reduceAll .................................................. [PASS] 31/116 test_basic ...................................................... [WAIT] 31/116 test_basic ...................................................... [PASS] 32/116 test_min1 ....................................................... [WAIT] 32/116 test_min1 ....................................................... [PASS] 33/116 operator_div_scalar ............................................. [WAIT]

33/116 operator_div_scalar ............................................. [PASS] 34/116 self_eq ......................................................... [WAIT]

34/116 self_eq ......................................................... [PASS] 35/116 test_mean ....................................................... [WAIT] 35/116 test_mean ....................................................... [PASS] 36/116 inplace_cos ..................................................... [WAIT] 36/116 inplace_cos ..................................................... [PASS] 37/116 outplace_log .................................................... [WAIT] 37/116 outplace_log .................................................... [PASS] 38/116 operator_plus ................................................... [WAIT] 38/116 operator_plus ................................................... [PASS] 39/116 outplace_eq ..................................................... [WAIT]

39/116 outplace_eq ..................................................... [PASS] 40/116 outplace_gt ..................................................... [WAIT]

40/116 outplace_gt ..................................................... [PASS] 41/116 inplace_exp ..................................................... [WAIT] 41/116 inplace_exp ..................................................... [PASS] 42/116 test_gather_t ................................................... [WAIT] 42/116 test_gather_t ................................................... [PASS] 43/116 outplace_asin ................................................... [WAIT] 43/116 outplace_asin ................................................... [PASS] 44/116 operator_sub_scalar ............................................. [WAIT]

44/116 operator_sub_scalar ............................................. [PASS] 45/116 outplace_lt ..................................................... [WAIT]

45/116 outplace_lt ..................................................... [PASS] 46/116 test_map2 ....................................................... [WAIT] 46/116 test_map2 ....................................................... [PASS] 47/116 operator_plus_scalar ............................................ [WAIT]

47/116 operator_plus_scalar ............................................ [PASS] 48/116 test_addcdivshape ............................................... [WAIT] 48/116 test_addcdivshape ............................................... [PASS] 49/116 inplace_sin ..................................................... [WAIT] 49/116 inplace_sin ..................................................... [PASS] 50/116 test_sum_t ...................................................... [WAIT] 50/116 test_sum_t ...................................................... [PASS] 51/116 inplace_atan2 ................................................... [WAIT] 51/116 inplace_atan2 ................................................... [PASS] 52/116 test_norm ....................................................... [WAIT] 52/116 test_norm ....................................................... [PASS] 53/116 self_ge ......................................................... [WAIT]

53/116 self_ge ......................................................... [PASS] 54/116 operator_mul_scalar ............................................. [WAIT]

54/116 operator_mul_scalar ............................................. [PASS] 55/116 outplace_sigmoid ................................................ [WAIT] 55/116 outplace_sigmoid ................................................ [PASS] 56/116 self_ne ......................................................... [WAIT]

56/116 self_ne ......................................................... [PASS] 57/116 outplace_sign ................................................... [WAIT] 57/116 outplace_sign ................................................... [PASS] 58/116 test_gather_narrowed ............................................ [WAIT] new wrapper, size 4 new wrapper, size 4 58/116 test_gather_narrowed ............................................ [PASS] 59/116 test_neg ........................................................ [WAIT] 59/116 test_neg ........................................................ [PASS] 60/116 test_indexfill .................................................. [WAIT] 60/116 test_indexfill .................................................. [PASS] 61/116 test_gather ..................................................... [WAIT] 61/116 test_gather ..................................................... [PASS] 62/116 test_sum ........................................................ [WAIT] 62/116 test_sum ........................................................ [PASS] 63/116 test_apply ...................................................... [WAIT] 63/116 test_apply ...................................................... [PASS] 64/116 test_cmin ....................................................... [WAIT] 64/116 test_cmin ....................................................... [PASS] 65/116 test_perelement ................................................. [WAIT] 65/116 test_perelement ................................................. [PASS] 66/116 test_min2 ....................................................... [WAIT] 66/116 test_min2 ....................................................... [PASS] 67/116 test_max2 ....................................................... [WAIT] 67/116 test_max2 ....................................................... [PASS] 68/116 test_cumprod .................................................... [WAIT] 68/116 test_cumprod .................................................... [PASS] 69/116 outplace_cos .................................................... [WAIT] 69/116 outplace_cos .................................................... [PASS] 70/116 inplace_ge ...................................................... [WAIT] 70/116 inplace_ge ...................................................... [PASS] 71/116 test_indexselect ................................................ [WAIT] 71/116 test_indexselect ................................................ [PASS] 72/116 inplace_add ..................................................... [WAIT] 72/116 inplace_add ..................................................... [PASS] 73/116 inplace_gt ...................................................... [WAIT] 73/116 inplace_gt ...................................................... [PASS] 74/116 test_max1 ....................................................... [WAIT] 74/116 test_max1 ....................................................... [PASS] 75/116 test_reshape .................................................... [WAIT] 75/116 test_reshape .................................................... [PASS] 76/116 test_cmul ....................................................... [WAIT] 76/116 test_cmul ....................................................... [PASS] 77/116 test_powerofneg ................................................. [WAIT] 77/116 test_powerofneg ................................................. [PASS] 78/116 test_blas ....................................................... [WAIT] 78/116 test_blas ....................................................... [PASS] 79/116 self_gt ......................................................... [WAIT]

79/116 self_gt ......................................................... [PASS] 80/116 inplace_floor ................................................... [WAIT] 80/116 inplace_floor ................................................... [PASS] 81/116 outplace_acos ................................................... [WAIT] 81/116 outplace_acos ................................................... [PASS] 82/116 inplace_sign .................................................... [WAIT] 82/116 inplace_sign .................................................... [PASS] 83/116 inplace_asin .................................................... [WAIT] 83/116 inplace_asin .................................................... [PASS] 84/116 outplace_ceil ................................................... [WAIT] 84/116 outplace_ceil ................................................... [PASS] 85/116 operator_sub .................................................... [WAIT] 85/116 operator_sub .................................................... [PASS] 86/116 outplace_abs .................................................... [WAIT] 86/116 outplace_abs .................................................... [PASS] 87/116 test_indexcopy .................................................. [WAIT] 87/116 test_indexcopy .................................................. [PASS] 88/116 test_sub ........................................................ [WAIT] 88/116 test_sub ........................................................ [PASS] 89/116 outplace_round .................................................. [WAIT] 89/116 outplace_round .................................................. [PASS] 90/116 test_cumsum ..................................................... [WAIT] 90/116 test_cumsum ..................................................... [PASS] 91/116 inplace_abs ..................................................... [WAIT] 91/116 inplace_abs ..................................................... [PASS] 92/116 outplace_le ..................................................... [WAIT]

92/116 outplace_le ..................................................... [PASS] 93/116 test_clone ...................................................... [WAIT] 93/116 test_clone ...................................................... [PASS] 94/116 test_fills ...................................................... [WAIT] 94/116 test_fills ...................................................... [PASS] 95/116 test_addcdiv .................................................... [WAIT] 95/116 test_addcdiv .................................................... [PASS] 96/116 inplace_cpow .................................................... [WAIT] 96/116 inplace_cpow .................................................... [PASS] 97/116 outplace_exp .................................................... [WAIT] 97/116 outplace_exp .................................................... [PASS] 98/116 outplace_floor .................................................. [WAIT] 98/116 outplace_floor .................................................. [PASS] 99/116 inplace_eq ...................................................... [WAIT] 99/116 inplace_eq ...................................................... [PASS] 100/116 outplace_sqrt ................................................... [WAIT] 100/116 outplace_sqrt ................................................... [PASS] 101/116 outplace_cinv ................................................... [WAIT] 101/116 outplace_cinv ................................................... [PASS] 102/116 test_sumallt .................................................... [WAIT] 102/116 test_sumallt .................................................... [PASS] 103/116 inplace_round ................................................... [WAIT] 103/116 inplace_round ................................................... [PASS] 104/116 inplace_cdiv .................................................... [WAIT] 104/116 inplace_cdiv .................................................... [PASS] 105/116 test_sum_t_offset ............................................... [WAIT] 105/116 test_sum_t_offset ............................................... [PASS] 106/116 outplace_atan ................................................... [WAIT] 106/116 outplace_atan ................................................... [PASS] 107/116 test_add ........................................................ [WAIT] 107/116 test_add ........................................................ [PASS] 108/116 test_sumall ..................................................... [WAIT] 108/116 test_sumall ..................................................... [PASS] 109/116 inplace_cmul .................................................... [WAIT] 109/116 inplace_cmul .................................................... [PASS] 110/116 inplace_lt ...................................................... [WAIT] 110/116 inplace_lt ...................................................... [PASS] 111/116 outplace_cmul ................................................... [WAIT] 111/116 outplace_cmul ................................................... [PASS] 112/116 inplace_ne ...................................................... [WAIT] 112/116 inplace_ne ...................................................... [PASS] 113/116 inplace_sigmoid ................................................. [WAIT] 113/116 inplace_sigmoid ................................................. [PASS] 114/116 self_le ......................................................... [WAIT]

114/116 self_le ......................................................... [PASS] 115/116 test_meanall .................................................... [WAIT] 115/116 test_meanall .................................................... [PASS] 116/116 test_save ....................................................... [WAIT] 116/116 test_save ....................................................... [PASS] Completed 190 asserts in 116 tests with 0 failures and 0 errors all tests finished

hughperkins commented 8 years ago

Cool! :-)

rsaliy commented 7 years ago

Hi, I've got the same issue with cltorch.test() ldd $(which clinfo) shows: .......... ldd $(which clinfo)

linux-gate.so.1 =>  (0xb776a000)
librt.so.1 => /lib/i386-linux-gnu/librt.so.1 (0xb7741000)
libm.so.6 => /lib/i386-linux-gnu/libm.so.6 (0xb76ec000)
libdl.so.2 => /lib/i386-linux-gnu/libdl.so.2 (0xb76e6000)
libOpenCL.so.1 => not found
libgcc_s.so.1 => /lib/i386-linux-gnu/libgcc_s.so.1 (0xb76c9000)
libpthread.so.0 => /lib/i386-linux-gnu/libpthread.so.0 (0xb76ac000)
libc.so.6 => /lib/i386-linux-gnu/libc.so.6 (0xb74f6000)
/lib/ld-linux.so.2 (0x80076000)

............................................................ locate libOpenCL.so shows: .......... locate libOpenCL.so

/opt/AMDAPPSDK-3.0/lib/x86/libOpenCL.so
/opt/AMDAPPSDK-3.0/lib/x86/libOpenCL.so.1
/opt/AMDAPPSDK-3.0/lib/x86_64/libOpenCL.so
/opt/AMDAPPSDK-3.0/lib/x86_64/sdk/libOpenCL.so
/opt/AMDAPPSDK-3.0/lib/x86_64/sdk/libOpenCL.so.1
/opt/amd-opencl-icd-VERSION.MINOR/lib/libOpenCL.so.1
hughperkins commented 7 years ago

Hi, I'm taking a break from OpenCL for a bit, since NIPS, trying some new things. Your options therefore are:

On the whole, I think the second option will work better, but it depends. I will mark the issue as 'help wanted'.

hughperkins commented 7 years ago

(Note: you might want to open a new issue for this, so people see it. Maybe someone can help you to fix it?)

rsaliy commented 7 years ago

@hughperkins thanks, done https://github.com/hughperkins/distro-cl/issues/17