Open maharshi365 opened 6 years ago
I'm having the same issue. AMD config changes don't change the behavior unless platform index is other than 0, but 1 doesn't exist if I do that. Using Dual Xeon CPU and Dual HD 6970 Version: xmr-stak/2.1.0/30888a3/master/win/nvidia-amd-cpu/aeon-monero/20
/*
GPU configuration. You should play around with intensity and worksize as the fastest settings will vary. index - GPU index number usually starts from 0 intensity - Number of parallel GPU threads (nothing to do with CPU threads) worksize - Number of local GPU threads (nothing to do with CPU threads) affine_to_cpu - This will affine the thread to a CPU. This can make a GPU miner play along nicer with a CPU miner. strided_index - switch memory pattern used for the scratch pad memory true = use 16byte contiguous memory per thread, the next memory block has offset of intensity blocks false = use a contiguous block of memory per thread "gpu_threads_conf" : [ { "index" : 1, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false, "strided_index" :true }, ], */ "gpu_threads_conf" : [ // gpu: Cayman memory:384 // compute units: 24 { "index" : 0, "intensity" : 0, "worksize" : 8, "affine_to_cpu" : false, "strided_index" : false }, // gpu: Cayman memory:384 // compute units: 24 { "index" : 1, "intensity" : 0, "worksize" : 8, "affine_to_cpu" : false, "strided_index" : false },
],
/*
Platform index. This will be 0 unless you have different OpenCL platform - eg. AMD and Intel. */ "platform_index" : 1, CLINFO: Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.0 AMD-APP (1800.8) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices
Platform Name: AMD Accelerated Parallel Processing Number of devices: 2 Device Type: CL_DEVICE_TYPE_GPU Vendor ID: 1002h Board name: AMD Radeon HD 6900 Series Device Topology: PCI[ B#2, D#0, F#0 ] Max compute units: 24 Max work items dimensions: 3 Max work items[0]: 256 Max work items[1]: 256 Max work items[2]: 256 Max work group size: 256 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 4 Preferred vector width double: 2 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 4 Native vector width double: 2 Max clock frequency: 880Mhz Address bits: 32 Max memory allocation: 536870912 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 8 Max image 2D width: 16384 Max image 2D height: 16384 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 1024 Alignment (bits) of base address: 2048 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: No Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: None Cache line size: 0 Cache size: 0 Global memory size: 2147483648 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Scratchpad Local memory size: 32768 Max pipe arguments: 0 Max pipe active reservations: 0 Max pipe packet size: 0 Max global variable size: 0 Max global variable preferred total size: 0 Max read/write image args: 0 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities: Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 64 Error correction support: 0 Unified memory for Host and Device: 0 Profiling timer resolution: 1 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: No Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: No Profiling : No Platform ID: 00007FFDC042F180 Name: Cayman Vendor: Advanced Micro Devices, Inc. Device OpenCL C version: OpenCL C 1.2 Driver version: 1800.8 (VM) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1800.8) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_image2d_from_buffer_read_only cl_khr_spir cl_khr_gl_event
Device Type: CL_DEVICE_TYPE_CPU Vendor ID: 1002h Board name: Max compute units: 16 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 4 Preferred vector width double: 2 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 4 Native vector width double: 2 Max clock frequency: 2267Mhz Address bits: 64 Max memory allocation: 12882603008 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 32768 Global memory size: 51530412032 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 4292668416 Max global variable size: 1879048192 Max global variable preferred total size: 1879048192 Max read/write image args: 64 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities: Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 451 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: No Profiling : No Platform ID: 00007FFDC042F180 Name: Intel(R) Xeon(R) CPU E5520 @ 2.27GHz Vendor: GenuineIntel Device OpenCL C version: OpenCL C 1.2 Driver version: 1800.8 (sse2) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1800.8) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_spir cl_khr_gl_event
Hi i seem to have a common problem. where i get Error CL_DEVICE_NOT_FOUND when calling clGetDeviceIDs
Here is my CL info
Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.0 AMD-APP (1912.5) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices
Platform Name: AMD Accelerated Parallel Processing Number of devices: 1 Device Type: CL_DEVICE_TYPE_CPU Vendor ID: 1002h Board name: Max compute units: 8 Max work items dimensions: 3 Max work items[0]: 1024 Max work items[1]: 1024 Max work items[2]: 1024 Max work group size: 1024 Preferred vector width char: 16 Preferred vector width short: 8 Preferred vector width int: 4 Preferred vector width long: 2 Preferred vector width float: 8 Preferred vector width double: 4 Native vector width char: 16 Native vector width short: 8 Native vector width int: 4 Native vector width long: 2 Native vector width float: 8 Native vector width double: 4 Max clock frequency: 3998Mhz Address bits: 64 Max memory allocation: 4279986176 Image support: Yes Max number of images read arguments: 128 Max number of images write arguments: 64 Max image 2D width: 8192 Max image 2D height: 8192 Max image 3D width: 2048 Max image 3D height: 2048 Max image 3D depth: 2048 Max samplers within kernel: 16 Max size of kernel argument: 4096 Alignment (bits) of base address: 1024 Minimum alignment (bytes) for any datatype: 128 Single precision floating point capability Denorms: Yes Quiet NaNs: Yes Round to nearest even: Yes Round to zero: Yes Round to +ve and infinity: Yes IEEE754-2008 fused multiply-add: Yes Cache type: Read/Write Cache line size: 64 Cache size: 32768 Global memory size: 17119944704 Constant buffer size: 65536 Max number of constant args: 8 Local memory type: Global Local memory size: 32768 Max pipe arguments: 16 Max pipe active reservations: 16 Max pipe packet size: 4279986176 Max global variable size: 1879048192 Max global variable preferred total size: 1879048192 Max read/write image args: 64 Max on device events: 0 Queue on device max size: 0 Max on device queues: 0 Queue on device preferred size: 0 SVM capabilities: Coarse grain buffer: No Fine grain buffer: No Fine grain system: No Atomics: No Preferred platform atomic alignment: 0 Preferred global atomic alignment: 0 Preferred local atomic alignment: 0 Kernel Preferred work group size multiple: 1 Error correction support: 0 Unified memory for Host and Device: 1 Profiling timer resolution: 256 Device endianess: Little Available: Yes Compiler available: Yes Execution capabilities: Execute OpenCL kernels: Yes Execute native function: Yes Queue on Host properties: Out-of-Order: No Profiling : Yes Queue on Device properties: Out-of-Order: No Profiling : No Platform ID: 00007FFFB8CCE188 Name: Intel(R) Core(TM) i7-4790K CPU @ 4.00GHz Vendor: GenuineIntel Device OpenCL C version: OpenCL C 1.2 Driver version: 1912.5 (sse2,avx) Profile: FULL_PROFILE Version: OpenCL 1.2 AMD-APP (1912.5) Extensions: cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_spir cl_khr_gl_event
and My config
/*
Number of GPUs that you have in your system. Each GPU will get its own CPU thread. / "gpu_thread_num" : 6, /
GPU configuration. You should play around with intensity and worksize as the fastest settings will vary. index - GPU index number usually starts from 0 intensity - Number of parallel GPU threads (nothing to do with CPU threads) worksize - Number of local GPU threads (nothing to do with CPU threads) affine_to_cpu - This will affine the thread to a CPU. This can make a GPU miner play along nicer with a CPU miner. / "gpu_threads_conf" : [ { "index" : 0, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, { "index" : 1, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, { "index" : 2, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, { "index" : 3, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, { "index" : 4, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, { "index" : 5, "intensity" : 1000, "worksize" : 8, "affine_to_cpu" : false }, ], /
Platform index. This will be 0 unless you have different OpenCL platform - eg. AMD and Intel. / "platform_index" : 0, /
TLS Settings If you need real security, make sure tls_secure_algo is enabled (otherwise MITM attack can downgrade encryption to trivially breakable stuff like DES and MD5), and verify the server's fingerprint through a trusted channel. use_tls - This option will make us connect using Transport Layer Security. tls_secure_algo - Use only secure algorithms. This will make us quit with an error if we can't negotiate a secure algo. tls_fingerprint - Server's SHA256 fingerprint. If this string is non-empty then we will check the server's cert against it. / "use_tls" : false, "tls_secure_algo" : true, "tls_fingerprint" : "", /
pool_address - Pool address should be in the form "pool.supportxmr.com:3333". Only stratum pools are supported. wallet_address - Your wallet, or pool login. pool_password - Can be empty in most cases or "x". / "pool_address" : "pool.supportxmr.com:3333", "wallet_address" : "", "pool_password" : "", /
Network timeouts. Because of the way this client is written it doesn't need to constantly talk (keep-alive) to the server to make sure it is there. We detect a buggy / overloaded server by the call timeout. The default values will be ok for nearly all cases. If they aren't the pool has most likely overload issues. Low call timeout values are preferable - long timeouts mean that we waste hashes on potentially stale jobs. Connection report will tell you how long the server usually takes to process our calls. call_timeout - How long should we wait for a response from the server before we assume it is dead and drop the connection. retry_time - How long should we wait before another connection attempt. Both values are in seconds. giveup_limit - Limit how many times we try to reconnect to the pool. Zero means no limit. Note that stak miners don't mine while the connection is lost, so your computer's power usage goes down to idle. */ "call_timeout" : 10, "retry_time" : 10, "giveup_limit" : 0,
/*
Output control. Since most people are used to miners printing all the time, that's what we do by default too. This is suboptimal really, since you cannot see errors under pages and pages of text and performance stats. Given that we have internal performance monitors, there is very little reason to spew out pages of text instead of concise reports. Press 'h' (hashrate), 'r' (results) or 'c' (connection) to print reports. verbose_level - 0 - Don't print anything. 1 - Print intro, connection event, disconnect event 2 - All of level 1, and new job (block) event if the difficulty is different from the last job 3 - All of level 1, and new job (block) event in all cases, result submission event. 4 - All of level 3, and automatic hashrate report printing */ "verbose_level" : 3,
/*
Automatic hashrate report h_print_time - How often, in seconds, should we print a hashrate report if verbose_level is set to 4. This option has no effect if verbose_level is not 4. */ "h_print_time" : 60,
/*
Daemon mode If you are running the process in the background and you don't need the keyboard reports, set this to true. This should solve the hashrate problems on some emulated terminals. / "daemon_mode" : false, /
Output file output_file - This option will log all output to a file. */ "output_file" : "",
/*
Built-in web server I like checking my hashrate on my phone. Don't you? Keep in mind that you will need to set up port forwarding on your router if you want to access it from outside of your home network. Ports lower than 1024 on Linux systems will require root. httpd_port - Port we should listen on. Default, 0, will switch off the server. / "httpd_port" : 0, /
prefer_ipv4 - IPv6 preference. If the host is available on both IPv4 and IPv6 net, which one should be choose? This setting will only be needed in 2020's. No need to worry about it now. */ "prefer_ipv4" : true,
Any help would be awesome