RX 6600 XT - Githubissues

RomanKondratovich commented 2 years ago

Hello, how about RX 6600 XT?

error log:

7.16.20 (unknown error) - exit code 3221225477 (0xc0000005) Platform name: AMD Accelerated Parallel Processing Platform version: OpenCL 2.1 AMD-APP (3354.13) Device: gfx1032 Device version: OpenCL 2.0 AMD-APP (3354.13) Device memory: 8176MB OPTIONS: -cl-std=CL2.0 -DBOARDSIZE=27 -DGPU_DEPTH=11 -DWORKSPACE_SIZE=20971520 -DWORKGROUP_SIZE=64 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00007ff97d4c1f4c read attempt to address 0x00000000 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 7.16.14 Dump Timestamp : 02/17/22 03:55:59 Install Directory : C:\Program Files\BOINC\ Data Directory : C:\ProgramData\BOINC Project Symstore : LoadLibraryA( C:\ProgramData\BOINC\dbghelp.dll ): GetLastError = 126 Loaded Library : dbghelp.dll LoadLibraryA( C:\ProgramData\BOINC\symsrv.dll ): GetLastError = 126 LoadLibraryA( symsrv.dll ): GetLastError = 126 LoadLibraryA( C:\ProgramData\BOINC\srcsrv.dll ): GetLastError = 126 LoadLibraryA( srcsrv.dll ): GetLastError = 126 LoadLibraryA( C:\ProgramData\BOINC\version.dll ): GetLastError = 126 Loaded Library : version.dll Debugger Engine : 4.0.5.0 Symbol Search Path: C:\ProgramData\BOINC\slots\0;C:\ProgramData\BOINC\projects\www.rechenkraft.net_yoyo Get Product Name Failed. ModLoad: 0000000040000000 000000000019d000 C:\ProgramData\BOINC\projects\www.rechenkraft.net_yoyo\m-queens2-boinc-ocl.exe (1.0.0.0) (-nosymbols- Symbols Loaded) File Version : 1, 0, 0, 0 Company Name : MingW-W64 Project. All rights reserved. Product Name : Product Version : 1, 0, 0, 0 ModLoad: 00000000d1070000 00000000001f5000 C:\Windows\SYSTEM32\ntdll.dll (10.0.19041.1466) (-exported- Symbols Loaded) File Version : 10.0.19041.1466 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1466 ModLoad: 00000000d0470000 00000000000be000 C:\Windows\System32\KERNEL32.DLL (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1503 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1503 ModLoad: 00000000ce7b0000 00000000002c8000 C:\Windows\System32\KERNELBASE.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1503 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1503 ModLoad: 00000000cfb60000 00000000000ae000 C:\Windows\System32\ADVAPI32.dll (10.0.19041.1466) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000d0e60000 000000000009e000 C:\Windows\System32\msvcrt.dll (7.0.19041.546) (-exported- Symbols Loaded) File Version : 7.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 7.0.19041.546 ModLoad: 00000000d09e0000 000000000009c000 C:\Windows\System32\sechost.dll (10.0.19041.1466) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000d0f00000 0000000000125000 C:\Windows\System32\RPCRT4.dll (10.0.19041.1466) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000cf0c0000 0000000000744000 C:\Windows\System32\SHELL32.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.964 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.964 ModLoad: 00000000ceb20000 000000000009d000 C:\Windows\System32\msvcp_win.dll (10.0.19041.789) (-exported- Symbols Loaded) File Version : 10.0.19041.789 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.789 ModLoad: 00000000cee00000 0000000000100000 C:\Windows\System32\ucrtbase.dll (10.0.19041.789) (-exported- Symbols Loaded) File Version : 10.0.19041.789 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.789 ModLoad: 00000000d0590000 00000000001a0000 C:\Windows\System32\USER32.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000ce780000 0000000000022000 C:\Windows\System32\win32u.dll (10.0.19041.1526) (-exported- Symbols Loaded) File Version : 10.0.19041.1526 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1526 ModLoad: 00000000cfb30000 000000000002b000 C:\Windows\System32\GDI32.dll (10.0.19041.1202) (-exported- Symbols Loaded) File Version : 10.0.19041.1202 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1202 ModLoad: 00000000cefb0000 000000000010d000 C:\Windows\System32\gdi32full.dll (10.0.19041.1466) (-exported- Symbols Loaded) File Version : 10.0.19041.1466 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1466 ModLoad: 0000000061d00000 0000000000012000 C:\ProgramData\BOINC\projects\www.rechenkraft.net_yoyo\libOpenCL.dll (2.2.6.0) (-exported- Symbols Loaded) File Version : 2.2.6.0 Company Name : Khronos Group Product Name : Khronos OpenCL ICD Loader Product Version : ModLoad: 00000000ced20000 000000000004e000 C:\Windows\System32\CFGMGR32.dll (10.0.19041.1151) (-exported- Symbols Loaded) File Version : 10.0.19041.1151 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1151 ModLoad: 00000000cfa00000 000000000012a000 C:\Windows\System32\ole32.dll (10.0.19041.1202) (-exported- Symbols Loaded) File Version : 10.0.19041.1320 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1320 ModLoad: 00000000d0a80000 0000000000355000 C:\Windows\System32\combase.dll (10.0.19041.1348) (-exported- Symbols Loaded) File Version : 10.0.19041.1320 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1320 ModLoad: 00000000d0900000 0000000000030000 C:\Windows\System32\IMM32.DLL (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 000000007fac0000 00000000040fa000 C:\Windows\System32\DriverStore\FileRepository\u0376255.inf_amd64_b67dbc7531b4ea7c\B376137\amdocl64.dll (10.0.3354.13) (-exported- Symbols Loaded) File Version : 10.0.3354.13 Company Name : Advanced Micro Devices Inc. Product Name : OpenCL 2.0 AMD-APP (3354.13) Product Version : 10.0.3354.13 ModLoad: 00000000b5a80000 0000000000125000 C:\Windows\SYSTEM32\OPENGL32.dll (10.0.19041.1081) (-exported- Symbols Loaded) File Version : 10.0.19041.1081 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1081 ModLoad: 00000000b6a20000 000000000002c000 C:\Windows\SYSTEM32\GLU32.dll (10.0.19041.1081) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000cff20000 0000000000472000 C:\Windows\System32\SETUPAPI.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000cea80000 0000000000027000 C:\Windows\System32\bcrypt.dll (10.0.19041.1023) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000d0530000 0000000000055000 C:\Windows\System32\SHLWAPI.dll (10.0.19041.1023) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000cc490000 000000000002f000 C:\Windows\SYSTEM32\dwmapi.dll (10.0.19041.746) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000ce4b0000 0000000000034000 C:\Windows\SYSTEM32\DEVOBJ.dll (10.0.19041.1151) (-exported- Symbols Loaded) File Version : 10.0.19041.1151 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1151 ModLoad: 00000000ceab0000 0000000000069000 C:\Windows\System32\WINTRUST.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1503 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1503 ModLoad: 00000000cebc0000 0000000000156000 C:\Windows\System32\CRYPT32.dll (10.0.19041.1320) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000ce2f0000 0000000000012000 C:\Windows\SYSTEM32\MSASN1.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 000000007a5b0000 000000000550b000 C:\Windows\System32\DriverStore\FileRepository\u0376255.inf_amd64_b67dbc7531b4ea7c\B376137\amd_comgr.dll (-exported- Symbols Loaded) ModLoad: 00000000d03a0000 00000000000cd000 C:\Windows\System32\OLEAUT32.dll (10.0.19041.985) (-exported- Symbols Loaded) File Version : 10.0.19041.985 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.985 ModLoad: 00000000c55c0000 000000000000a000 C:\Windows\SYSTEM32\VERSION.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 00000000b6a50000 0000000000067000 C:\Windows\SYSTEM32\IntelOpenCL64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel(R) OpenCL(TM) SDK Product Version : 20.19.15.4531 ModLoad: 00000000a9030000 00000000001c2000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\intelocl64.dll (5.2.0.10094) (-exported- Symbols Loaded) File Version : 5.2.0.10094 Company Name : Intel Corporation Product Name : Intel(R) SDK for OpenCL* Applications Product Version : 5.2.0.10094 ModLoad: 00000000aa840000 000000000009d000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\task_executor64.dll (5.2.0.10094) (-exported- Symbols Loaded) File Version : 5.2.0.10094 Company Name : Intel Corporation Product Name : Intel(R) OpenCL(TM) SDK Product Version : 5.2.0.10094 ModLoad: 000000009ab60000 00000000000d8000 C:\Program Files (x86)\Common Files\Intel\OpenCL\bin\x64\cpu_device64.dll (5.2.0.10094) (-exported- Symbols Loaded) File Version : 5.2.0.10094 Company Name : Intel Corporation Product Name : Intel(R) OpenCL(TM) SDK Product Version : 5.2.0.10094 ModLoad: 000000008cc50000 00000000004c1000 C:\Windows\SYSTEM32\igdrcl64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel HD Graphics Drivers for Windows(R) Product Version : 20.19.15.4531 ModLoad: 00000000cd2d0000 00000000000f4000 C:\Windows\SYSTEM32\dxgi.dll (10.0.19041.1266) (-exported- Symbols Loaded) File Version : 10.0.19041.1266 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1266 ModLoad: 00000000cd020000 0000000000012000 C:\Windows\SYSTEM32\kernel.appcore.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 00000000cc780000 0000000000794000 C:\Windows\SYSTEM32\windows.storage.dll (10.0.19041.1503) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000ce100000 000000000002e000 C:\Windows\SYSTEM32\Wldp.dll (10.0.19041.1320) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000cf820000 00000000000ad000 C:\Windows\System32\SHCORE.dll (10.0.19041.1387) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000c06f0000 000000000003b000 C:\Windows\SYSTEM32\dxcore.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 00000000bc660000 0000000000043000 C:\Windows\SYSTEM32\igdfcl64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel HD Graphics Drivers for Windows(R) Product Version : 20.19.15.4531 ModLoad: 000000008c4f0000 0000000000577000 C:\Windows\SYSTEM32\igdmcl64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel HD Graphics Drivers for Windows(R) Product Version : 20.19.15.4531 ModLoad: 00000000c7f30000 00000000001e4000 C:\Windows\SYSTEM32\dbghelp.dll (10.0.19041.867) (-exported- Symbols Loaded) File Version : 10.0.19041.867 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.867 ModLoad: 00000000b0070000 000000000006d000 C:\Windows\SYSTEM32\igdbcl64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel HD Graphics Drivers for Windows(R) Product Version : 20.19.15.4531 ModLoad: 00000000bece0000 0000000000a40000 C:\Windows\SYSTEM32\igdusc64.dll (20.19.15.4531) (-exported- Symbols Loaded) File Version : 20.19.15.4531 Company Name : Intel Corporation Product Name : Intel HD Graphics Drivers for Windows(R) Product Version : 20.19.15.4531 ModLoad: 00000000ce050000 0000000000018000 C:\Windows\SYSTEM32\CRYPTSP.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 00000000cd7e0000 0000000000034000 C:\Windows\system32\rsaenh.dll (10.0.19041.1052) (-exported- Symbols Loaded) File Version : 10.0.19041.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1 ModLoad: 00000000ce070000 000000000000c000 C:\Windows\SYSTEM32\CRYPTBASE.dll (10.0.19041.546) (-exported- Symbols Loaded) File Version : 10.0.19041.546 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.546 ModLoad: 00000000ced70000 0000000000082000 C:\Windows\System32\bcryptPrimitives.dll (10.0.19041.1415) (-exported- Symbols Loaded) File Version : 10.0.19041.1415 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.19041.1415

sudden6 commented 2 years ago

This crash might be because I made a mistake, which only the RX580 compiler seems to tolerate. See: https://github.com/llvm/llvm-project/issues/53436#issuecomment-1071618469 for further details. I'm working on a fixed version.

RomanKondratovich commented 2 years ago

Ok! i can test after fix :)

sudden6 commented 2 years ago

This should be fixed now on https://github.com/sudden6/m-queens/tree/gpu_recursive

Can you compile this on MSYS2 yourself? Otherwise I'll provide binaries in a few days.

RomanKondratovich commented 2 years ago

make: *** No rule to make target '../../static_boinc/lib64/libboinc.a', needed by 'm-queens-boinc'. Stop. win 10 + MSYS2

sudden6 commented 2 years ago

I think I was overeager with this, I need to cleanup the build system and merge it with some other patches, sorry for the confusion.

RomanKondratovich commented 2 years ago

Thanks sudden6 for you work! Now i can compile and test boinc-ocl to RX6600 XT and RTX3060. as I see an error somewhere in the generation, since 1n times a output file of 10M can be generated, and sometimes 2.5M, 1st time it was possible to generate 38M :) out1.zip out2.zip stderr.zip stderr_big.zip out_big.zip

sudden6 commented 2 years ago

@RomanKondratovich can you try and build the standalone m-queens program without boinc? It allows you to try to solve different board sizes stand alone which makes things more deterministic.

I checked the output you delivered, there doesn't seem to be any obvious problem. However I rebuilt my dev environment and found and fixed a bug that could affect you. I also added a new option to the m-queens2 program, so that you can enable and disable the debug printf from the OpenCL kernel at run time. So maybe you give it another try. I'm also experiencing problems for board sizes >= 13, without debug printf enabled, so maybe you can do a run from with N=11,12,13,14 and we can compare the outputs.

RomanKondratovich commented 2 years ago

Yess!! :) works great standalone and ocl version! now stderr: Platform name: AMD Accelerated Parallel Processing Platform version: OpenCL 2.1 AMD-APP (3354.13) Device: gfx1032 Device version: OpenCL 2.0 AMD-APP (3354.13) OpenCL 2.x supported Device memory: 8176MB Allocation limit: 6732MB

Aligning to SUM_REDUCTION_FACTOR: 20971520 OPTIONS: -cl-std=CL2.0 -DBOARDSIZE=27 -DGPU_DEPTH=11 -DWORKSPACE_SIZE=20971520 -DWORKGROUP_SIZE=64 -DSUM_REDUCTION_FACTOR=32768 -DDEBUG_PRINT=0 OpenCL build log:

OCL Kernel memory: 2560MB Threads: 1 Starting cleanup Launching reduction sum kernels, count: 640, workspace_size: 20971520, SUM_REDUCTION_FACTOR: 32768 Starting cleanup Launching reduction sum kernels, count: 640, workspace_size: 20971520, SUM_REDUCTION_FACTOR: 32768 Starting cleanup Launching reduction sum kernels, count: 640, workspace_size: 20971520, SUM_REDUCTION_FACTOR: 32768 Starting cleanup Launching reduction sum kernels, count: 640, workspace_size: 20971520, SUM_REDUCTION_FACTOR: 32768 Starting cleanup Launching reduction sum kernels, count: 640, workspace_size: 20971520, SUM_REDUCTION_FACTOR: 32768 Starting cleanup

RomanKondratovich commented 2 years ago

Nvidia test: m-queens2.exe -s 11 -m ocl -p 0 -d 0 Platform name: NVIDIA CUDA Platform version: OpenCL 3.0 CUDA 11.4.243 Device: NVIDIA GeForce RTX 3060 Device version: OpenCL 3.0 CUDA OpenCL 3.x supported Device memory: 12288MB Allocation limit: 3072MB

Allocation limit reached, truncating to: 26843545 Aligning to SUM_REDUCTION_FACTOR: 26836992 OPTIONS: -cl-std=CL2.0 -DBOARDSIZE=11 -DGPU_DEPTH=8 -DWORKSPACE_SIZE=26836992 -D WORKGROUP_SIZE=64 -DSUM_REDUCTION_FACTOR=32768 -DDEBUG_PRINT=0 OpenCL build log:

:25:16: error: variadic macros not supported in OpenCL #define printf(...) ^ program.build failed: -11 Solver init failed

sudden6 commented 2 years ago

Hmmm, you can probably workaround the Nvidia problem, by manually commenting out all printf lines, maybe I find a workaround for that^^

RomanKondratovich commented 2 years ago

Jep, but next error: m-queens2.exe -s 12 works great. m-queens2.exe -s 13: Platform name: NVIDIA CUDA Platform version: OpenCL 3.0 CUDA 11.4.243 Device: NVIDIA GeForce RTX 3060 Device version: OpenCL 3.0 CUDA OpenCL 3.x supported Device memory: 12288MB Allocation limit: 3072MB

Allocation limit reached, truncating to: 26843545 Aligning to SUM_REDUCTION_FACTOR: 26836992 OPTIONS: -cl-std=CL2.0 -DBOARDSIZE=13 -DGPU_DEPTH=10 -DWORKSPACE_SIZE=26836992 -DWORKGROUP_SIZE=64 -DSUM_REDUCTION_FACTOR=32768 -DDEBUG_PRINT=0 OpenCL build log: ptxas error : Entry function 'relaunch_kernel' uses too much shared data (0xc038 bytes, 0xc000 max) ptxas error : Entry function '__kernel___relaunch_kernel_block_invoke$34' uses too much shared data (0xc038 bytes, 0xc000 max)

sudden6 commented 2 years ago

Unfortunately I don't think that's something I can fix :(

I guess there's better luck with Intel and AMD. Is m-queens2 now fully working on your RX6600? no errors for all board sizes?

RomanKondratovich commented 2 years ago

Yes, AMD RX 6600 XT works great. Tasks completed 90, estimated time ~ 1h/boinc task

sudden6 commented 2 years ago

1h/boinc task seems quite long to me, my old RX580 does one task in ~10-20min, but I optimized for that card, soo....

Are the tasks confirmed by the server to return the correct results?

Can you share a screenshot of GPU-Z with the sensors page while running a task?

RomanKondratovich commented 2 years ago

Yes all task is confirmed. Gpu-z is attached. RX6600xt_opencl_1 RX6600xt_opencl_2 RX6600xt_opencl_3 RX6600xt_opencl_4 RX6600xt_opencl_5 RX6600xt_advanced RX6600xt_info RX6600xt_sensors

sudden6 commented 2 years ago

Thank you for the info!

It seems something is limiting the performance, because your card only draws 53W, but the GPU shows maximum load. Might need some fine tuning, maybe the workgroup size is not optimal for RDNA2 cards.

sudden6 commented 2 years ago

@RomanKondratovich You could try and set this constant: https://github.com/sudden6/m-queens/blob/gpu_recursive/clsolver.cpp#L31 to 32, I think that could help on your GPU.

RomanKondratovich commented 2 years ago

Changed WORKGROUP_SIZE from 64 to 32: Allocation limit: 6732MB

Aligning to SUM_REDUCTION_FACTOR: 20971520 OPTIONS: -cl-std=CL2.0 -DBOARDSIZE=27 -DGPU_DEPTH=11 -DWORKSPACE_SIZE=20971520 -DWORKGROUP_SIZE=32 -DSUM_REDUCTION_FACTOR=32768 -DDEBUG_PRINT=0 OpenCL build log:

OCL Kernel memory: 2560MB Threads: 1

Unhandled Exception Detected...

Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x00007ffad352b80c read attempt to address 0xC996E000

sudden6 commented 2 years ago

huh, that's not supposed to happen, need to investigate...

RomanKondratovich commented 2 years ago

i can share this pc to you over teamviewer or anydesk.

sudden6 commented 2 years ago

Thank you for the offer, but developing via TeamViewer or similar will probably not be efficient, as GPU hangs tend to crash the whole system... Also I don't really have time for a long development session :(

I'll try reproducing this on my systems first lets see what if it's reproducible first.

sudden6 commented 1 year ago

@RomanKondratovich I finally found some time to look into this further, I found some race conditions in the OpenCL code, maybe that fixes it?

RomanKondratovich commented 1 year ago

Great news! Great news! can you the build exe file?

sudden6 commented 1 year ago

I'll do as soon as possible, might be one or two days though.

sudden6 commented 1 year ago

@RomanKondratovich a release is published, please tell me if you detect any regressions.

sudden6 / m-queens

RX 6600 XT #2