preda / gpuowl

GPU Mersenne primality test.
GNU General Public License v3.0
163 stars 39 forks source link

ROCm 3.8, GpuOwl requires OpenCL 200, found 120 #192

Closed baryluk closed 4 years ago

baryluk commented 4 years ago

gpuowl at 7e40f5b0af5d42e57c0406ce6b254a67d2af8cbe

Linux, amd64. Compiled using g++ 10.2.0

I am using standard OpenCL loader, and OpenCL headers.

user@debian:~/gpuowl$ dpkg -l | grep opencl
ii  mesa-opencl-icd:amd64                                       20.1.9-1                             amd64        free implementation of the OpenCL API -- ICD runtime
ii  ocl-icd-libopencl1:amd64                                    2.2.12-4                             amd64        Generic OpenCL ICD Loader
ii  ocl-icd-libopencl1:i386                                     2.2.12-4                             i386         Generic OpenCL ICD Loader
ii  ocl-icd-opencl-dev:amd64                                    2.2.12-4                             amd64        OpenCL development files
ii  ocl-icd-opencl-dev:i386                                     2.2.12-4                             i386         OpenCL development files
ii  opencl-c-headers                                            3.0~2020.05.12-g5cc337c-1            all          OpenCL (Open Computing Language) C header files
ii  opencl-clhpp-headers                                        2.2.0~~2.0.11+git9-g0192662-1        all          C++ headers for OpenCL development
ii  opencl-headers                                              3.0~2020.05.12-g5cc337c-1            all          OpenCL (Open Computing Language) header files
ii  rocm-opencl-dev3.8.0                                        2.0.0.293-rocm-rel-3.8-30-c3b7a21    amd64        OpenCL: Open Computing Language on ROCclr
ii  rocm-opencl3.8.0                                            2.0.0.293-rocm-rel-3.8-30-c3b7a21    amd64        OpenCL: Open Computing Language on ROCclr

the ICD is configured to load ROCm 3.8, and it works with many other applications.

clinfo, shows me this (abriviated):

Number of platforms                               1
  Platform Name                                   AMD Accelerated Parallel Processing
  Platform Vendor                                 Advanced Micro Devices, Inc.
  Platform Version                                OpenCL 2.0 AMD-APP (3186.0)
  Platform Profile                                FULL_PROFILE
  Platform Extensions                             cl_khr_icd cl_amd_event_callback 
  Platform Extensions function suffix             AMD

  Platform Name                                   AMD Accelerated Parallel Processing
Number of devices                                 1
  Device Name                                     gfx803
  Device Vendor                                   Advanced Micro Devices, Inc.
  Device Vendor ID                                0x1002
  Device Version                                  OpenCL 1.2 
  Driver Version                                  3186.0 (HSA1.1,LC)
  Device OpenCL C Version                         OpenCL C 2.0 
  Device Type                                     GPU
  Device Board Name (AMD)                         Fiji [Radeon R9 FURY / NANO Series]
  Device Topology (AMD)                           PCI-E, 43:00.0
  Device Profile                                  FULL_PROFILE
  Device Available                                Yes
  Compiler Available                              Yes
  Linker Available                                Yes
user@debian:~/gpuowl$ ldd ./gpuowl
    linux-vdso.so.1 (0x00007ffd5d774000)
    libOpenCL.so.1 => /opt/rocm/lib/libOpenCL.so.1 (0x00007fd21a4fc000)
    libgmp.so.10 => /lib/x86_64-linux-gnu/libgmp.so.10 (0x00007fd21a479000)
    libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007fd21a2ac000)
    libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fd21a168000)
    libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007fd21a14e000)
    libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007fd21a12c000)
    libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fd219f65000)
    libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007fd219f5f000)
    /lib64/ld-linux-x86-64.so.2 (0x00007fd21a7f0000)

However, trying to run a test on gpuowl gives me this:

$ ./gpuowl -prp 96359411 -time
2020-10-05 17:54:48 gpuowl v7.0-7-g7e40f5b
2020-10-05 17:54:48 Note: not found 'config.txt'
2020-10-05 17:54:48 config: -prp 96359411 -time 
2020-10-05 17:54:48 device 0, unique id ''
2020-10-05 17:54:48 gfx803-0 96359411 FFT: 5M 1K:10:256 (18.38 bpw)
2020-10-05 17:54:48 gfx803-0 96359411 OpenCL args "-DEXP=96359411u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DAMDGPU=1 -DCARRY64=1 -DCARRYM64=1 -DMM_CHAIN=1u -DMM2_CHAIN=1u -DMAX_ACCURACY=1 -DWEIGHT_STEP_MINUS_1=0x1.135f22fff92e3p-1 -DIWEIGHT_STEP_MINUS_1=-0x1.662100950bfe7p-2  -cl-unsafe-math-optimizations -cl-std=CL2.0 -cl-finite-math-only "
2020-10-05 17:54:50 gfx803-0 96359411 /tmp/comgr-c0a9cd/input/CompileSource:50:9: warning: GpuOwl requires OpenCL 200, found 120
#pragma message "GpuOwl requires OpenCL 200, found " STR(__OPENCL_VERSION__)
        ^
1 warning generated.

2020-10-05 17:54:50 gfx803-0 96359411 OpenCL compilation in 2.59 s
2020-10-05 17:54:51 gfx803-0 96359411 Use -maxAlloc to limit GPU memory usage; see -h
2020-10-05 17:54:51 gfx803-0 Exiting because "missing -maxAlloc"
2020-10-05 17:54:51 gfx803-0 Bye
$

Any help would be appreciated.

Here is strace of all opened files:

$ strace -e openat -f ./gpuowl -prp 96359411 -time 2>&1 | grep -v ENOENT | grep -v '/sys'
openat(AT_FDCWD, "/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/opt/rocm/lib/libOpenCL.so.1", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libgmp.so.10", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libstdc++.so.6", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libm.so.6", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libgcc_s.so.1", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libpthread.so.0", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libc.so.6", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libdl.so.2", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "/etc/localtime", O_RDONLY|O_CLOEXEC) = 3
openat(AT_FDCWD, "results.txt", O_WRONLY|O_CREAT|O_APPEND, 0666) = 3
openat(AT_FDCWD, "gpuowl.log", O_WRONLY|O_CREAT|O_APPEND, 0666strace: Process 3185969 attached
) = 4
[pid 3185969] +++ exited with 0 +++
openat(AT_FDCWD, "results.txt", O_WRONLY|O_CREAT|O_APPEND, 0666) = 3
strace: Process 3185970 attached
[pid 3185970] +++ exited with 0 +++
openat(AT_FDCWD, "results.txt", O_WRONLY|O_CREAT|O_APPEND, 0666) = 3
strace: Process 3185971 attached
[pid 3185971] +++ exited with 0 +++
openat(AT_FDCWD, "/etc/OpenCL/vendors/", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
openat(AT_FDCWD, "/etc/OpenCL/vendors/amdocl64_30800.icd", O_RDONLY) = 5
openat(AT_FDCWD, "/opt/rocm/lib/../opencl/lib/libamdocl64.so", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/opt/rocm/lib/../opencl/lib/../../lib/libhsa-runtime64.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/librt.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/opt/rocm/lib/../opencl/lib/../../lib/libhsakmt.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libelf.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libnuma.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libz.so.1", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/proc/self/status", O_RDONLY) = 6
openat(AT_FDCWD, "/proc/self/status", O_RDONLY) = 6
openat(AT_FDCWD, "/proc/self/maps", O_RDONLY|O_CLOEXEC) = 6
openat(AT_FDCWD, "/dev/kfd", O_RDWR|O_CLOEXEC) = 6
openat(AT_FDCWD, "/dev/dri/renderD128", O_RDWR|O_CLOEXEC) = 9
openat(AT_FDCWD, "/usr/share/hwdata/pci.ids", O_RDONLY) = 7
openat(AT_FDCWD, "/dev/shm/sem.hsakmt_semaphore", O_RDWR|O_NOFOLLOW) = -1 EACCES (Permission denied)
openat(AT_FDCWD, "/proc/cpuinfo", O_RDONLY) = 7
openat(AT_FDCWD, "/usr/share/hwdata/pci.ids", O_RDONLY) = 7
strace: Process 3185972 attached
[pid 3185968] openat(AT_FDCWD, "/opt/rocm/lib/../opencl/lib/../../lib/libamd_comgr.so.1", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/lib/x86_64-linux-gnu/libtinfo.so.5", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 7
[pid 3185968] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 7
[pid 3185968] openat(AT_FDCWD, "/opt/rocm/lib/../opencl/lib/../../lib/libhsa-amd-aqlprofile64.so", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/etc/ld.so.cache", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/dev/urandom", O_RDONLY) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/include/opencl2.0-c.pch", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/input/CompileSource", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185968] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/include/opencl2.0-c.pch", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/input/CompileSource", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/output/CompileSource-6ec07e38.bc.tmp", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/include/opencl2.0-c.pch", O_RDONLY|O_CLOEXEC) = 7
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/output/CompileSource.bc", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-50b51e/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-aeedc9", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-aeedc9/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-aeedc9/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-aeedc9/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/input/linked.bc", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185968] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/input/linked.bc", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/output/linked.bc-75bb00bc.o.tmp", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/output/linked.bc.o", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-e17199/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/input/linked.bc.o", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185968] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/input/linked.bc.o", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/output/a.so.tmpc25fafd", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/output/a.so.tmp9ca2bb9", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0777) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/output/a.so", O_RDONLY|O_CLOEXEC) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/tmp/comgr-5cabb7/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185968] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 3
[pid 3185968] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 3
strace: Process 3185974 attached
[pid 3185974] +++ exited with 0 +++
strace: Process 3185975 attached
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/include/opencl1.2-c.pch", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/input/CompileSource", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185975] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/include/opencl1.2-c.pch", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/input/CompileSource", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/output/CompileSource-e9e95450.bc.tmp", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/include/opencl1.2-c.pch", O_RDONLY|O_CLOEXEC) = 7
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/output/CompileSource.bc", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-c4239e/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-f07b2c", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-f07b2c/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-f07b2c/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-f07b2c/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/input/linked.bc", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185975] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/input/linked.bc", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/output/linked.bc-282d1e13.o.tmp", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/output/linked.bc.o", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-bdb867/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/input/linked.bc.o", O_WRONLY|O_CREAT|O_TRUNC|O_CLOEXEC, 0666) = 3
[pid 3185975] openat(AT_FDCWD, "/etc/debian_version", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/input/linked.bc.o", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/output/a.so.tmp894e93e", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0666) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/output/a.so.tmp0f1cae7", O_RDWR|O_CREAT|O_EXCL|O_CLOEXEC, 0777) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/output/a.so", O_RDONLY|O_CLOEXEC) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 3
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/include", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/output", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/tmp/comgr-1a1f40/input", O_RDONLY|O_NONBLOCK|O_CLOEXEC|O_DIRECTORY) = 5
[pid 3185975] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 3
[pid 3185975] openat(AT_FDCWD, "/dev/random", O_WRONLY) = 3
[pid 3185975] +++ exited with 0 +++
2020-10-05 17:59:27 gpuowl v7.0-7-g7e40f5b
2020-10-05 17:59:27 Note: not found 'config.txt'
2020-10-05 17:59:27 config: -prp 96359411 -time 
2020-10-05 17:59:27 device 0, unique id ''
2020-10-05 17:59:27 gfx803-0 96359411 FFT: 5M 1K:10:256 (18.38 bpw)
2020-10-05 17:59:27 gfx803-0 96359411 OpenCL args "-DEXP=96359411u -DWIDTH=1024u -DSMALL_HEIGHT=256u -DMIDDLE=10u -DAMDGPU=1 -DCARRY64=1 -DCARRYM64=1 -DMM_CHAIN=1u -DMM2_CHAIN=1u -DMAX_ACCURACY=1 -DWEIGHT_STEP_MINUS_1=0x1.135f22fff92e3p-1 -DIWEIGHT_STEP_MINUS_1=-0x1.662100950bfe7p-2  -cl-unsafe-math-optimizations -cl-std=CL2.0 -cl-finite-math-only "
2020-10-05 17:59:30 gfx803-0 96359411 /tmp/comgr-50b51e/input/CompileSource:50:9: warning: GpuOwl requires OpenCL 200, found 120
#pragma message "GpuOwl requires OpenCL 200, found " STR(__OPENCL_VERSION__)
        ^
1 warning generated.

2020-10-05 17:59:30 gfx803-0 96359411 OpenCL compilation in 2.59 s
2020-10-05 17:59:30 gfx803-0 96359411 Use -maxAlloc to limit GPU memory usage; see -h
2020-10-05 17:59:30 gfx803-0 Exiting because "missing -maxAlloc"
2020-10-05 17:59:30 gfx803-0 Bye
[pid 3185972] +++ exited with 0 +++
+++ exited with 0 +++
$

This is me trying GpuOwl first time ever, so I don't know if maybe my config is wrong or ROCm issue.

preda commented 4 years ago

That is just a #pragma message, not an error, let's ignore it.

Please just pass a -maxAlloc option which is mandatory now, e.g.: -maxAlloc 3G

This limits how much memory the app is allocating on the GPU. Also you can try the v6 branch which is stable https://github.com/preda/gpuowl/tree/v6 If you start using GpuOwl, you may find the discussion on mersenneforum.org/ useful.

preda commented 4 years ago
2020-10-05 17:54:51 gfx803-0 96359411 Use -maxAlloc to limit GPU memory usage; see -h
2020-10-05 17:54:51 gfx803-0 Exiting because "missing -maxAlloc"
2020-10-05 17:54:51 gfx803-0 Bye
baryluk commented 4 years ago

@preda Doh. I am blind. It was just a warning message.

Yes, -maxAlloc 2048M and it works. Yay.

Thanks.