Closed xmronadaily closed 5 years ago
I'm mining BitTube using XMRSTAK 2.7.1, with AMD Driver 18.5.1, Vega 64 Card, no issues.
I'm using dual intensity of 976
. May be able to go higher, but this works. If your using higher intensity, try this for a starting point.
Hmmm, that is really strange, but really good to know! I've even used lower intensity of dual 960, but it would give invalid shares no matter what... Could you please tell me what other parameters you have in your amd config? And also are your cards running at 1100 memory or stock?
{ "index" : 1, "intensity" : 976, "worksize" : 16, "affine_to_cpu" : false, "strided_index" : 2, "mem_chunk" : 2, "unroll" : 2, "comp_mode" : false },
{ "index" : 1, "intensity" : 976, "worksize" : 16, "affine_to_cpu" : false, "strided_index" : 2, "mem_chunk" : 2, "unroll" : 2, "comp_mode" : false },
Card: Mem_P3=1100;905 / GPU_P7=1408;905
Okay, thank you so much! I'm in 18.6.1 atm, I'll try reverting to 18.5.1 and with your settings, I'll update how it goes! Also I'm running same core and memory clock as you, so no deviation there :)
No probs, I've had problems with 18.6.1, so have stuck with 18.5.1. Of course, update here, and close if fixed for you, so it saves the devs time with all issues :)
Also if you fancy trying a smaller pool, please check out https://tube.pool.gntl.co.uk
@BKdilse So I just tried it, with your amd config setting and 18.5.1 driver, but no luck, invalid shares again... But the thing is, it was working fine until I did a windows reinstall a couple of days ago, so it has to be something with files in .openclcache not getting compiled properly.
Is there any way that you could please go to your C:\Users\youruserhere\AppData\Local.openclcache and and send me over the openclbin files you have there? I'd copy-paste in my folder and try that as last resort, other than that, don't know what to do... Thank you so much for your input here!
You have been clearing your opencl cache files between driver changes?
@unsivilaudio Yeah, every time. Tried on every possible version with clear cache, and miner would recompile the bins, but it never fixed the invalid shares...
I'll give it a shot, I'm on 18.6.1 currently.
On Sat, Jan 12, 2019 at 2:00 PM xmronadaily notifications@github.com wrote:
@unsivilaudio https://github.com/unsivilaudio Yeah, every time. Tried on every possible version with clear cache, and miner would recompile the bins, but it never fixed the invalid shares...
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/fireice-uk/xmr-stak/issues/2169#issuecomment-453780582, or mute the thread https://github.com/notifications/unsubscribe-auth/AZAqFqzh22qjW0dZ19XW7BtQJc6-hFDyks5vCkz2gaJpZM4Z8qzi .
Yeah working fine for me, I even deleted my cache and let it rebuild.
2 threads, 896 intensity, worksize 16, strided_index 2, mem_chunk 16
Yes, I always clear my cache with driver/xmr-stak updates.
No idea what's wrong then, what windows version are you on?
Can you please share your openclcache files with me? I'd really appreciate it if it's a way to get it working :)
Some great news at last! @BKdilse You sir, saved my life! So, Vegas are finally submitting valid shares on CN heavy!!! 100% acceptance. But what I had to do was the following, I took your openclbin and pasted it in my folder, but when I ran the miner, it still started compiling its own file, and then it submitted invalid shares again...
But then I went back to the openclcache folder and renamed your file to fit the name of the file the miner just compiled:
I renamed "eb80a01cc8fe1c292765149194af6ee11bf5c990404dcd17c7fbf80241db3867" to "766d71af38aa097ade9860cf4638d2a5b2106eab8db3f958e6690bae6c23bdd7"
And now when I ran it, it was actually selecting your file and works just fine!
I'm just happy it got fixed and thank you so much for your help and contribution and sending me that crucial bin! I'm backing up that and saving it like gold! :D
That's a really weird issue, but glad you have it sorted. No problems.
Yeah this says to me you aren't fully uninstalling the bad drivers. You should use DDU in safe mode.
On Sun, Jan 13, 2019 at 8:45 AM BK notifications@github.com wrote:
That's a really weird issue, but glad you have it sorted. No problems.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/fireice-uk/xmr-stak/issues/2169#issuecomment-453840259, or mute the thread https://github.com/notifications/unsubscribe-auth/AZAqFrsCP8hg_Ews3Dwu8DfS2t_DSDf2ks5vC1SkgaJpZM4Z8qzi .
Oh but I was, every time I ran ddu in safe mode, unless it was somehow fucking up and windows was maybe at fault... I had this issue before actually https://github.com/fireice-uk/xmr-stak/issues/2124
There I did the same thing, I ddu in safe mode and installed 18.6.1 actually and then it worked fine, I was doing the same thing now, but on different windows, and it wasn't working, luckily those bins BKdilse sent worked... Don't know what caused it, the only variable left is Windows versions, the other one was some older cracked version from TPB (dont know what edition) and the current one is Home dled from Microsoft's site. Really weird this whole thing...
@BKdilse Did you try running your cards for CN heavy at worksize of 8? It should give a small hashrate bump, as well as memchunk, I think going 10 for memchunk up from 2 also increases hashrate. It does for me, but I can't use those settings since it's instant invalid shares 'cause the only working bins are yours :D
@xmronadaily I've played with various settings in the past, and settled with the ones I have. I'll give it another go though :)
Yeap! Do check it out and see if there's an improvement for you :)
Resolved the issue:
Reinstalled windows again, installed 18.5.1 drivers and made sure to only run vega cards in amd.txt config when they're compiling the binaries, running them in conjuction with other cards produces faulty binaries for vegas. After good binaries are compiled, you can add other cards to amd config file and vegas still run fine.
I tested this on 18.3.4 as well.
Please provide as much as possible information to reproduce the issue.
Basic information
RX Vega 64 giving invalid shares on all drivers 18.3.4, 18.5.1, 18.5.2, 18.6.1 when mining CN Heavy algos. (Haven, bittube)
This seems to be only specific to Vega cards as I have r9 290x cards running on CN heavy and all shares are valid for them.
Other than that, all other algos are fine for both vega and r9 290x cards (CN2, cn-fast, etc.... )
One note however: I had this working properly somehow before, up until yesterday when I reinstalled windows. So, the openclcache was somehow compiled good previously in all the drivers that I used, and I had no problems with Vegas on CN heavy, but now after Windows reinstall and HDD format, it just doesn't work...
Compile issues
Used ready binaries and compiled myself, no difference.
AMD OpenCl issue
run
clinfo
and add the output hereNumber of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.1 AMD-APP (2580.6) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices
Platform Name: AMD Accelerated Parallel Processing Number of devices: 5 Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: Radeon RX Vega Device Topology: PCI[ B#3, D#0, F#0 ] Max compute units: 44 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1500Mhz Address bits: 64 Max memory allocation: 3422552064 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 4294967296 Constant buffer size: 3422552064 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 3422552064 Max global variable size: 3080296704 Max global variable preferred total size: 4294967296 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities: Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: Yes Profiling : Yes Platform ID: 00007FFCF7E7FFD0 Name: Hawaii Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 2580.6 Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (2580.6) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_khr_gl_depth_images cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_amd_liquid_flash
Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: Radeon RX Vega Device Topology: PCI[ B#4, D#0, F#0 ] Max compute units: 44 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1500Mhz Address bits: 64 Max memory allocation: 3422552064 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 4294967296 Constant buffer size: 3422552064 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 3422552064 Max global variable size: 3080296704 Max global variable preferred total size: 4294967296 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities: Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: Yes Profiling : Yes Platform ID: 00007FFCF7E7FFD0 Name: Hawaii Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 2580.6 Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (2580.6) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_khr_gl_depth_images cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_amd_liquid_flash
Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: Radeon RX Vega Device Topology: PCI[ B#15, D#0, F#0 ] Max compute units: 64 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1408Mhz Address bits: 64 Max memory allocation: 4244635648 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 8573157376 Constant buffer size: 4244635648 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 4244635648 Max global variable size: 3820172032 Max global variable preferred total size: 8573157376 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities: Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: Yes Profiling : Yes Platform ID: 00007FFCF7E7FFD0 Name: gfx900 Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 2580.6 (PAL,HSAIL) Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (2580.6) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_khr_gl_depth_images cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_amd_liquid_flash
Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: Radeon RX Vega Device Topology: PCI[ B#11, D#0, F#0 ] Max compute units: 64 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 256 Preferred vector width char: 4 Preferred vector width short: 2 Preferred vector width int: 1 Preferred vector width long: 1 Preferred vector width float: 1 Preferred vector width double: 1 Native vector width char: 4 Native vector width short: 2 Native vector width int: 1 Native vector width long: 1 Native vector width float: 1 Native vector width double: 1 Max clock frequency: 1408Mhz Address bits: 64 Max memory allocation: 4244635648 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 8573157376 Constant buffer size: 4244635648 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 4244635648 Max global variable size: 3820172032 Max global variable preferred total size: 8573157376 Max read/write image args: 64 Max on device events: 1024 Queue on device max size: 8388608 Max on device queues: 1 Queue on device preferred size: 262144 SVM capabilities: Coarse grain buffer: Yes Fine grain buffer: Yes Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: Yes Profiling : Yes Platform ID: 00007FFCF7E7FFD0 Name: gfx900 Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 2.0 Driver version: 2580.6 (PAL,HSAIL) Profile: FULL_PROFILE Version: OpenCL 2.0 AMD-APP (2580.6) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_khr_gl_depth_images cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_amd_liquid_flash
Device Type: CL_DEVICE_TYPE_CPU Vendor ID: 1002h Board name: Max compute units: 6 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 8 Preferred vector width double: 4 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 8 Native vector width double: 4 Max clock frequency: 3793Mhz Address bits: 64 Max memory allocation: 3194386432 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 16384 Global memory size: 12777545728 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 3194386432 Max global variable size: 1879048192 Max global variable preferred total size: 1879048192 Max read/write image args: 64 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities: Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 100 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: No Profiling : No Platform ID: 00007FFCF7E7FFD0 Name: AMD FX(tm)-6300 Six-Core Processor Vendor: AuthenticAMD Device OpenCL C version: OpenCL C 1.2 Driver version: 2580.6 (sse2,avx,fma4) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (2580.6) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_spir cl_khr_gl_event