Open SChernykh opened 5 years ago
GPU tests
AMD Radeon RX 560 on Windows 10 1809: core @ 1196 MHz, memory @ 2200 MHz, 1 Click PBE Timing Straps, 18.6.1 drivers, monitor plugged in, 2 threads (intensity 512, worksize 32, strided_index 2, mem_chunk 2, unroll 8):
AMD Vega 64 (stock, 2 threads, intensity 1920, worksize 16, strided_index 2, mem_chunk 2, unroll_factor 8):
CPU tests
AMD Ryzen 5 2600 @ 4 GHz, single thread:
I've been experimenting with Ryzen a lot, and this variance doesn't come from random math directly, it's somehow related to subtle instruction decoding stalls that happen for some code sequences. But I know how to fix it, it'll just take some more time.
Intel Pentium G5400 (Coffee Lake 3.7 GHz, 2 threads):
Intel Core i7-2600 (Sandy Bridge 3.4 GHz, 4 threads):
Dual AMD Opteron 6276 (16 physical cores, 24 threads)
Fluctuations are much smaller now compared to the first version.
CN/2 [2019-01-12 07:48:44] speed 10s/60s/15m 298.4 267.5 n/a H/s max 312.7 H/s | THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s | | 0 | -1 | 75.0 | 66.7 | n/a | | 1 | -1 | 75.7 | 67.3 | n/a | | 2 | -1 | 74.6 | 66.6 | n/a | | 3 | -1 | 75.6 | 67.3 | n/a |
CN/R
[2019-01-12 07:42:03] speed 10s/60s/15m 293.8 222.2 n/a H/s max 294.9 H/s | THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s | | 0 | -1 | 74.3 | 60.9 | n/a | | 1 | -1 | 74.2 | 60.8 | n/a | | 2 | -1 | 71.8 | 59.6 | n/a | | 3 | -1 | 73.3 | 69.3 | n/a |
model name : Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz (4 cores, 8 threads)
Did you test it for only 1 minute? You probably hit one of the slowest code sequences. Core 3 is also 9 H/s faster than other 3 cores which shouldn't happen - something was probably running in the background and spoiled test result.
Intel(R) Core(TM) i5-8250U CPU @ 1.60GHz (1) x64 AES
CN/2
[2019-01-12 18:46:13] new job from pool.wowne.ro:3333 diff 1080 algo cn/2
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 16.9 | 16.3 | 16.7 |
| 1 | 0 | 17.0 | 16.4 | 16.8 |
| 2 | 0 | 16.9 | 16.4 | 16.7 |
[2019-01-12 18:46:14] speed 10s/60s/15m 50.7 49.0 50.2 H/s max 50.9 H/s
CN/R
[2019-01-12 19:06:49] new job from testnet.wowne.ro:3333 diff 1950 algo cn/r height 350
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 17.9 | 16.5 | 17.2 |
| 1 | 0 | 18.0 | 16.5 | 17.2 |
| 2 | 0 | 17.9 | 16.5 | 17.2 |
[2019-01-12 19:06:50] speed 10s/60s/15m 53.8 49.4 51.7 H/s max 54.0 H/s
Intel® Core™ i5-5250U CPU @ 1.60GHz × 4
Hello, I am trying to provide some feedback for testing but always get this error...
[2019-01-12 12:01:29] new job from testnet.wowne.ro:3333 diff 5001 algo cn/r height 354 [2019-01-12 12:01:29] rejected (0/1) diff 5000 "Low difficulty share" (569 ms)
[2019-01-12 12:03:29] speed 10s/60s/15m 60.3 56.7 n/a H/s max 62.2 H/s [2019-01-12 12:03:30] new job from testnet.wowne.ro:3333 diff 1600 algo cn/r height 354 [2019-01-12 12:03:30] rejected (0/3) diff 1599 "Low difficulty share" (507 ms) [2019-01-12 12:03:30] speed 10s/60s/15m 60.1 56.8 n/a H/s max 62.2 H/s
Does testnet always reject shares? Just let me know... I have a few cpu's i can test and provide feedback
@jorgealonso108 It shouldn't reject shares. Did you compile the latest version?
Edit: I've compiled it using GCC 7.3.0 on Ubuntu 18.04 and GCC 8.2.1 on Windows 10 (MSYS2 build) and both builds worked fine, all shares were accepted.
I'm on Ubuntu 18.04 clean/fresh install....I have used the same version miner "XMRig/2.8.5-dev gcc/7.3.0" on other pools and have had no problems, its actually the first time I have seen that error...I don't really know what it means, but i would like to help fix the problem. Maybe latency to the pool? i.e. (569 ms) (507 ms) I can try it on a few cloud vM's closer to the pool? Maybe pool software? I do mine wownero and monero, cpu's only...let me know what i can do to help?
@jorgealonso108 How exactly did you compile it? Did you do
git clone https://github.com/SChernykh/xmrig.git
cd xmrig
git checkout CryptonightR
mkdir build
cd build
cmake ..
make
???
No...I compiled it from download zip folder...not with git clone I get this error with the above instructions...git checkout error?
ubuntu@ubuntu:~$ git clone https://github.com/SChernykh/xmrig.git Cloning into 'xmrig'... remote: Enumerating objects: 59, done. remote: Counting objects: 100% (59/59), done. remote: Compressing objects: 100% (29/29), done. remote: Total 5795 (delta 35), reused 41 (delta 30), pack-reused 5736 Receiving objects: 100% (5795/5795), 1.81 MiB | 1.24 MiB/s, done. Resolving deltas: 100% (4256/4256), done. ubuntu@ubuntu:~$ git checkout CryptonightR fatal: not a git repository (or any of the parent directories): .git ubuntu@ubuntu:~$
Please, if you can send me git clone exact instructions...I will follow them exactly. I was use xmrig, love it, never have a problem. I must be doing something wrong?
[ 86%] Building CXX object CMakeFiles/xmrig.dir/src/crypto/CryptonightR_gen.cpp.o
In file included from /home/ubuntu/Downloads/xmrig-CryptonightR/src/crypto/CryptonightR_gen.cpp:2:0:
/home/ubuntu/Downloads/xmrig-CryptonightR/src/crypto/variant4_random_math.h:182:12: warning: ‘int v4_random_math_init(V4_Instruction, uint64_t)’ defined but not used [-Wunused-function]
static int v4_random_math_init(struct V4_Instruction code, const uint64_t height)
^~~~~~~
[ 88%] Building CXX object CMakeFiles/xmrig.dir/src/common/log/SysLog.cpp.o
[ 90%] Building CXX object CMakeFiles/xmrig.dir/src/api/Api.cpp.o
cc1plus: warning: unrecognized command line option ‘-Wno-class-memaccess’
[ 91%] Building CXX object CMakeFiles/xmrig.dir/src/api/ApiRouter.cpp.o
Not trying to waste your time, just trying to help.
@jorgealonso108 I edited my post above: cd xmrig
shoud be done before git checkout
- you can try again.
Same problem as before...following these instructions exactly: git clone https://github.com/SChernykh/xmrig.git cd xmrig git checkout CryptonightR mkdir build cd build cmake .. make
ubuntu@ubuntu:~$ git clone https://github.com/SChernykh/xmrig.git
Cloning into 'xmrig'...
remote: Enumerating objects: 59, done.
remote: Counting objects: 100% (59/59), done.
remote: Compressing objects: 100% (29/29), done.
remote: Total 5795 (delta 35), reused 41 (delta 30), pack-reused 5736
Receiving objects: 100% (5795/5795), 1.81 MiB | 2.76 MiB/s, done.
Resolving deltas: 100% (4256/4256), done.
ubuntu@ubuntu:~$ cd xmrig
ubuntu@ubuntu:~/xmrig$ git checkout CryptonightR
Branch 'CryptonightR' set up to track remote branch 'CryptonightR' from 'origin'.
Switched to a new branch 'CryptonightR'
ubuntu@ubuntu:~/xmrig$ mkdir build
ubuntu@ubuntu:~/xmrig$ cd build
ubuntu@ubuntu:~/xmrig/build$ cmake ..
-- The C compiler identification is GNU 7.3.0
-- The CXX compiler identification is GNU 7.3.0
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found UV: /usr/lib/x86_64-linux-gnu/libuv.a
-- Found OpenSSL: /usr/lib/x86_64-linux-gnu/libcrypto.so (found version "1.1.0g")
-- The ASM compiler identification is GNU
-- Found assembler: /usr/bin/cc
-- Looking for syslog.h
-- Looking for syslog.h - found
-- Found MHD: /usr/lib/x86_64-linux-gnu/libmicrohttpd.so
-- Configuring done
-- Generating done
-- Build files have been written to: /home/ubuntu/xmrig/build
ubuntu@ubuntu:~/xmrig/build$ make
Scanning dependencies of target xmrig-asm
Scanning dependencies of target xmrig-asm
[ 1%] Building ASM object CMakeFiles/xmrig-asm.dir/src/crypto/asm/cnv2_main_loop.S.o
[ 3%] Building ASM object CMakeFiles/xmrig-asm.dir/src/crypto/asm/CryptonightR_template.S.o
[ 4%] Building ASM object CMakeFiles/xmrig-asm.dir/src/crypto/asm/CryptonightR_64_template.S.o
All good till it gets here...
[ 85%] Building C object CMakeFiles/xmrig.dir/src/crypto/c_skein.c.o
[ 86%] Building CXX object CMakeFiles/xmrig.dir/src/crypto/CryptonightR_gen.cpp.o
In file included from /home/ubuntu/xmrig/src/crypto/CryptonightR_gen.cpp:2:0:
/home/ubuntu/xmrig/src/crypto/variant4_random_math.h:182:12: warning: ‘int v4_random_math_init(V4_Instruction, uint64_t)’ defined but not used [-Wunused-function]
static int v4_random_math_init(struct V4_Instruction code, const uint64_t height)
^~~~~~~
cc1plus: warning: unrecognized command line option ‘-Wno-class-memaccess’
[ 88%] Building CXX object CMakeFiles/xmrig.dir/src/common/log/SysLog.cpp.o
[ 90%] Building CXX object CMakeFiles/xmrig.dir/src/api/Api.cpp.o
[ 91%] Building CXX object CMakeFiles/xmrig.dir/src/api/ApiRouter.cpp.o
[ 93%] Building CXX object CMakeFiles/xmrig.dir/src/common/api/Httpd.cpp.o
[ 95%] Building CXX object CMakeFiles/xmrig.dir/src/common/api/HttpRequest.cpp.o
[ 96%] Building CXX object CMakeFiles/xmrig.dir/src/common/net/Tls.cpp.o
[ 98%] Building CXX object CMakeFiles/xmrig.dir/src/crypto/Asm.cpp.o
[100%] Linking CXX executable xmrig
[100%] Built target xmrig
ubuntu@ubuntu:~/xmrig/build$
It's just a warning, you have xmrig binary at this point.
cnv2
[2019-01-12 16:11:47] speed 10s/60s/15m 316.8 315.3 314.9 H/s max 325.5 H/s | THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s | | 0 | -1 | 78.7 | 78.4 | 78.3 | | 1 | -1 | 79.6 | 79.3 | 79.2 | | 2 | -1 | 78.8 | 78.5 | 78.4 | | 3 | -1 | 79.5 | 79.2 | 79.1 |
cnr
[2019-01-12 15:50:17] speed 10s/60s/15m 342.3 342.4 338.7 H/s max 342.7 H/s | THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s | | 0 | -1 | 86.3 | 86.3 | 85.3 | | 1 | -1 | 84.9 | 85.0 | 84.0 | | 2 | -1 | 86.3 | 86.3 | 85.3 | | 3 | -1 | 84.9 | 84.9 | 84.0 |
model name : Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz
yeah, turns out i was running xmr-stak for some GPUs. must have been using enough of something on the CPU to run the GPUs.
Ok here are my results
Intel® Core™ i5-5250U CPU @ 1.60GHz × 4 sudo sysctl -w vm.nr_hugepages=128
figures shown are max h/s...CN/r seems to be more stable and slightly faster, see details below... Computer was completely left alone during testing...not even mouse movements...trying to get an exact apples to apples comparison.
CN/r 64.3
CN/v2 63.3
sudo ./xmrig --print-time 1 --max-cpu-usage 100 --cpu-priority 4 -o testnet.wowne.ro:3333 -u 9zuKdp6MrQaBSw85QP2fFh8zznN8tPbpJQZhX6wyVGHb5TdJu4QYcPK7EtfhUtnYbVUCFkNM9sPfViN5HvmXxV9iEVJP2o8 -p x -k commit 5367300
[2019-01-12 16:51:11] speed 10s/60s/15m 64.1 64.1 63.6 H/s max 64.3 H/s [2019-01-12 16:51:12] accepted (50/0) diff 2160 (324 ms) [2019-01-12 16:51:12] speed 10s/60s/15m 64.1 63.9 63.6 H/s max 64.3 H/s [2019-01-12 16:51:13] speed 10s/60s/15m 64.0 63.8 63.6 H/s max 64.3 H/s [2019-01-12 16:51:14] speed 10s/60s/15m 64.1 64.1 63.6 H/s max 64.3 H/s [2019-01-12 16:51:14] new job from testnet.wowne.ro:3333 diff 2190 algo cn/r height 386 [2019-01-12 16:51:15] speed 10s/60s/15m 64.0 63.9 63.6 H/s max 64.3 H/s
sudo ./xmrig --print-time 1 --max-cpu-usage 100 --cpu-priority 4 -o pool.wowne.ro:3333 -u Wo4hunKKcZ9JhCrqpFMotQQAiJAiATuJLNAK2HrPLoNzK8hkqNbf9t8gmx6bzAQrXRMnWnoELoiD6GTv8guPBRwH1FpjujDKH -p x -k commit 2b0b71b
[2019-01-12 17:59:37] speed 10s/60s/15m 61.7 60.2 60.1 H/s max 63.3 H/s [2019-01-12 17:59:37] new job from pool.wowne.ro:3333 diff 1740 algo cn/2 [2019-01-12 17:59:38] speed 10s/60s/15m 61.8 60.1 60.1 H/s max 63.3 H/s [2019-01-12 17:59:39] speed 10s/60s/15m 61.7 60.1 60.1 H/s max 63.3 H/s [2019-01-12 17:59:40] speed 10s/60s/15m 61.5 60.1 60.1 H/s max 63.3 H/s [2019-01-12 17:59:40] accepted (52/0) diff 1740 (326 ms) [2019-01-12 17:59:41] speed 10s/60s/15m 61.5 60.2 60.1 H/s max 63.3 H/s [2019-01-12 17:59:42] speed 10s/60s/15m 61.3 60.2 60.1 H/s max 63.3 H/s
AMD Ryzen 3 2200G, Linux 4.20.0, 2 threads:
CN/2: speed 10s/60s/15m 127.3 116.3 116.2 H/s max 134.0 H/s
CN/R: speed 10s/60s/15m 126.7 127.2 122.2 H/s max 130.5 H/s
Dual Xeon: Linux Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz (2) x64 AES
CN/2
[2019-01-15 11:32:02] new job from xmrpool.eu:7777 diff 10000 algo cn/2 height 0
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 50.1 | 50.1 | n/a |
| 1 | 1 | 50.9 | 51.0 | n/a |
| 2 | 2 | 43.0 | 43.1 | n/a |
| 3 | 3 | 51.9 | 51.9 | n/a |
| 4 | 4 | 51.3 | 51.4 | n/a |
| 5 | 5 | 48.0 | 48.0 | n/a |
| 6 | 6 | 47.2 | 47.2 | n/a |
| 7 | 7 | 46.0 | 46.0 | n/a |
| 8 | 8 | 46.3 | 46.3 | n/a |
| 9 | 9 | 51.3 | 51.3 | n/a |
| 10 | 10 | 51.4 | 51.4 | n/a |
| 11 | 11 | 42.1 | 42.2 | n/a |
| 12 | 12 | 50.1 | 50.1 | n/a |
| 13 | 13 | 49.2 | 49.2 | n/a |
| 14 | 14 | 50.4 | 50.4 | n/a |
| 15 | 15 | 51.2 | 51.2 | n/a |
| 16 | 16 | 43.2 | 43.2 | n/a |
| 17 | 17 | 52.0 | 52.0 | n/a |
| 18 | 18 | 51.5 | 51.5 | n/a |
| 19 | 19 | 48.1 | 48.1 | n/a |
| 20 | 20 | 47.4 | 47.4 | n/a |
| 21 | 21 | 46.0 | 46.0 | n/a |
| 22 | 22 | 46.4 | 46.4 | n/a |
| 23 | 23 | 51.3 | 51.3 | n/a |
| 24 | 24 | 51.4 | 51.4 | n/a |
| 25 | 25 | 42.1 | 42.1 | n/a |
| 26 | 26 | 50.1 | 50.1 | n/a |
| 27 | 27 | 49.2 | 49.2 | n/a |
| 28 | 30 | 43.3 | 43.4 | n/a |
| 29 | 39 | 42.4 | 42.4 | n/a |
| 30 | 44 | 43.5 | 43.5 | n/a |
| 31 | 53 | 42.4 | 42.4 | n/a |
[2019-01-15 11:32:03] speed 10s/60s/15m 1530.6 1531.2 n/a H/s max 1531.6 H/s
CN/R
[2019-01-15 11:29:30] new job from testnet.wowne.ro:3333 diff 47610 algo cn/r height 641
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 49.6 | 49.5 | n/a |
| 1 | 1 | 50.5 | 50.4 | n/a |
| 2 | 2 | 35.2 | 35.2 | n/a |
| 3 | 3 | 51.4 | 51.3 | n/a |
| 4 | 4 | 50.9 | 50.7 | n/a |
| 5 | 5 | 47.5 | 47.4 | n/a |
| 6 | 6 | 46.7 | 46.6 | n/a |
| 7 | 7 | 45.5 | 45.4 | n/a |
| 8 | 8 | 45.8 | 45.7 | n/a |
| 9 | 9 | 50.8 | 50.7 | n/a |
| 10 | 10 | 50.9 | 50.8 | n/a |
| 11 | 11 | 34.6 | 34.7 | n/a |
| 12 | 12 | 49.6 | 49.5 | n/a |
| 13 | 13 | 48.8 | 48.7 | n/a |
| 14 | 14 | 49.9 | 49.8 | n/a |
| 15 | 15 | 50.6 | 50.5 | n/a |
| 16 | 16 | 35.3 | 35.4 | n/a |
| 17 | 17 | 51.4 | 51.4 | n/a |
| 18 | 18 | 50.9 | 50.9 | n/a |
| 19 | 19 | 47.5 | 47.4 | n/a |
| 20 | 20 | 46.8 | 46.7 | n/a |
| 21 | 21 | 45.5 | 45.4 | n/a |
| 22 | 22 | 45.8 | 45.8 | n/a |
| 23 | 23 | 50.7 | 50.6 | n/a |
| 24 | 24 | 50.8 | 50.8 | n/a |
| 25 | 25 | 34.6 | 34.6 | n/a |
| 26 | 26 | 49.5 | 49.5 | n/a |
| 27 | 27 | 48.7 | 48.7 | n/a |
| 28 | 30 | 35.2 | 35.3 | n/a |
| 29 | 39 | 34.6 | 34.7 | n/a |
| 30 | 44 | 35.3 | 35.4 | n/a |
| 31 | 53 | 34.6 | 34.6 | n/a |
[2019-01-15 11:29:44] speed 10s/60s/15m 1455.6 1454.3 n/a H/s max 1469.1 H/s
So approx 5% speed difference
CN/2: [2019-01-15 11:32:03] speed 10s/60s/15m 1530.6 1531.2 n/a H/s max 1531.6 H/s
CN/R: [2019-01-15 11:29:44] speed 10s/60s/15m 1455.6 1454.3 n/a H/s max 1469.1 H/s
@nssy Do you have any double threads in your config? Double hash code for cn/r is far from optimized yet. Can you test both algorithms only with single threads?
No double hash threads used.
@nssy Still, you have 32 threads on only 28 physical cores, so 4 cores run 2 single threads each. This configuration is much slower because single thread already uses 80% of a single CPU core compute capacity. I'll need to implement double thread using SSE instructions to fix this slowdown.
Ok I have run both with 28 threads instead.
Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz (2) x64 AES
CN/2
[2019-01-15 12:25:09] READY (CPU) threads 28(28) huge pages 28/28 100% memory 56.0 MB
....
[2019-01-15 12:28:46] new job from xmrpool.eu:7777 diff 240009 algo cn/2 height 0
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 50.7 | 50.6 | n/a |
| 1 | 1 | 51.4 | 51.4 | n/a |
| 2 | 2 | 52.1 | 52.1 | n/a |
| 3 | 3 | 52.3 | 52.3 | n/a |
| 4 | 4 | 51.8 | 51.8 | n/a |
| 5 | 5 | 48.4 | 48.4 | n/a |
| 6 | 6 | 47.7 | 47.7 | n/a |
| 7 | 7 | 46.4 | 46.4 | n/a |
| 8 | 8 | 46.8 | 46.8 | n/a |
| 9 | 9 | 51.7 | 51.7 | n/a |
| 10 | 10 | 51.8 | 51.8 | n/a |
| 11 | 11 | 50.7 | 50.7 | n/a |
| 12 | 12 | 50.5 | 50.5 | n/a |
| 13 | 13 | 49.7 | 49.7 | n/a |
| 14 | 14 | 50.8 | 50.8 | n/a |
| 15 | 15 | 51.5 | 51.5 | n/a |
| 16 | 16 | 52.1 | 52.1 | n/a |
| 17 | 17 | 52.4 | 52.4 | n/a |
| 18 | 18 | 51.9 | 51.9 | n/a |
| 19 | 19 | 48.4 | 48.4 | n/a |
| 20 | 20 | 47.7 | 47.7 | n/a |
| 21 | 21 | 46.4 | 46.4 | n/a |
| 22 | 22 | 46.8 | 46.8 | n/a |
| 23 | 23 | 51.7 | 51.7 | n/a |
| 24 | 24 | 51.8 | 51.8 | n/a |
| 25 | 25 | 50.7 | 50.7 | n/a |
| 26 | 26 | 50.5 | 50.5 | n/a |
| 27 | 27 | 49.7 | 49.7 | n/a |
[2019-01-15 12:28:49] speed 10s/60s/15m 1404.6 1404.4 n/a H/s max 1404.6 H/s
CN/R
[2019-01-15 12:29:16] READY (CPU) threads 28(28) huge pages 28/28 100% memory 56.0 MB
...
[2019-01-15 12:30:21] new job from testnet.wowne.ro:3333 diff 2340 algo cn/r height 653
[2019-01-15 12:30:22] accepted (2/0) diff 2340 (212 ms)
| THREAD | AFFINITY | 10s H/s | 60s H/s | 15m H/s |
| 0 | 0 | 50.2 | 50.2 | n/a |
| 1 | 1 | 51.0 | 51.0 | n/a |
| 2 | 2 | 51.7 | 51.7 | n/a |
| 3 | 3 | 51.8 | 51.9 | n/a |
| 4 | 4 | 51.4 | 51.4 | n/a |
| 5 | 5 | 47.9 | 47.9 | n/a |
| 6 | 6 | 47.2 | 47.2 | n/a |
| 7 | 7 | 45.9 | 45.9 | n/a |
| 8 | 8 | 46.2 | 46.2 | n/a |
| 9 | 9 | 51.2 | 51.2 | n/a |
| 10 | 10 | 51.4 | 51.3 | n/a |
| 11 | 11 | 50.3 | 50.3 | n/a |
| 12 | 12 | 50.1 | 50.1 | n/a |
| 13 | 13 | 49.2 | 49.2 | n/a |
| 14 | 14 | 50.2 | 50.2 | n/a |
| 15 | 15 | 51.0 | 51.0 | n/a |
| 16 | 16 | 51.7 | 51.7 | n/a |
| 17 | 17 | 51.9 | 51.9 | n/a |
| 18 | 18 | 51.4 | 51.4 | n/a |
| 19 | 19 | 47.9 | 47.9 | n/a |
| 20 | 20 | 47.2 | 47.2 | n/a |
| 21 | 21 | 45.9 | 45.9 | n/a |
| 22 | 22 | 46.2 | 46.2 | n/a |
| 23 | 23 | 51.2 | 51.2 | n/a |
| 24 | 24 | 51.4 | 51.4 | n/a |
| 25 | 25 | 50.3 | 50.3 | n/a |
| 26 | 26 | 50.1 | 50.1 | n/a |
| 27 | 27 | 49.2 | 49.2 | n/a |
[2019-01-15 12:30:22] speed 10s/60s/15m 1391.1 1391.0 n/a H/s max 1391.2 H/s
So approx 0.9% speed difference
CN/2: [2019-01-15 12:28:49] speed 10s/60s/15m 1404.6 1404.4 n/a H/s max 1404.6 H/s
CN/R: [2019-01-15 12:30:22] speed 10s/60s/15m 1391.1 1391.0 n/a H/s max 1391.2 H/s
I've tried Opteron 6276. With single thread I got
Single Thread: 53H/s
When invoking 24 Threads I got:
24 Threads: 760H/s
But 760:24=31,66H/s
Something is woring, I tried all settings I could but got no better performance.
Does someone has a solution?
Opteron 6276 has 8 core modules and 16 MB L2 cache, dual system has 16 modules and 32 MB L2 cache, but you run 24 threads - it's more than it can handle without dropping single thread performance.
Sorry, I did not mention that it is a dual board. 2 x Opteron 6276 inside a HP dl165 G7. In total 16 cores/32 threads.
More than 1 thread (or scratchpad) per core is not optimized yet. I'll be working on optimized double hash code this and next week, so you can expect some improvement when it's done.
Thanks
Intel(R) Xeon(R) CPU E5-2630 v4 @ 2.20GHz sudo sysctl -w vm.nr_hugepages=128
cn/R sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o testnet.wowne.ro:3333 -u 9zuKdp6MrQaBSw85QP2fFh8zznN8tPbpJQZhX6wyVGHb5TdJu4QYcPK7EtfhUtnYbVUCFkNM9sPfViN5HvmXxV9iEVJP2o8 -p x -k [2019-01-20 20:39:10] accepted (55/0) diff 1000 (23 ms) [2019-01-20 20:39:10] new job from testnet.wowne.ro:3333 diff 1000 algo cn/r height 108 [2019-01-20 20:39:10] speed 10s/60s/15m 32.8 32.7 32.8 H/s max 35.1 H/s
cn/2 sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o pool.wowne.ro:3333 -u Wo4fVqVg7prJTJ44hPmJK8e3bzBWL7gWVLnU8vSeg4NwUQvxuxLdB9xZY6PZHmhLxeCYPSxCYwcqkfa86Zf8gcFw2NRRyT3mN -p x -k [2019-01-20 20:37:54] speed 10s/60s/15m 37.0 36.8 36.2 H/s max 37.5 H/s [2019-01-20 20:37:55] new job from pool.wowne.ro:3333 diff 1110 algo cn/2 [2019-01-20 20:37:55] speed 10s/60s/15m 37.1 36.8 36.2 H/s max 37.5 H/s
[2019-01-20 20:38:46] speed 10s/60s/15m 37.0 36.9 36.3 H/s max 37.5 H/s [2019-01-20 20:38:47] accepted (63/0) diff 1110 (91 ms) [2019-01-20 20:38:47] speed 10s/60s/15m 36.7 36.8 36.3 H/s max 37.5 H/s
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz sudo sysctl -w vm.nr_hugepages=128
cn/R sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o testnet.wowne.ro:3333 -u 9zuKdp6MrQaBSw85QP2fFh8zznN8tPbpJQZhX6wyVGHb5TdJu4QYcPK7EtfhUtnYbVUCFkNM9sPfViN5HvmXxV9iEVJP2o8 -p x -k [2019-01-21 05:09:40] new job from testnet.wowne.ro:3333 diff 1230 algo cn/r height 134
[2019-01-21 05:09:56] speed 10s/60s/15m 37.1 36.9 36.9 H/s max 37.7 H/s [2019-01-21 05:09:57] accepted (52/0) diff 1230 (88 ms) [2019-01-21 05:09:57] speed 10s/60s/15m 37.3 36.9 36.9 H/s max 37.7 H/s
cn/2 sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o pool.wowne.ro:3333 -u Wo4fVqVg7prJTJ44hPmJK8e3bzBWL7gWVLnU8vSeg4NwUQvxuxLdB9xZY6PZHmhLxeCYPSxCYwcqkfa86Zf8gcFw2NRRyT3mN -p x -k [2019-01-21 05:58:58] new job from pool.wowne.ro:3333 diff 1140 algo cn/2
[2019-01-21 05:59:42] speed 10s/60s/15m 38.3 38.3 38.3 H/s max 38.9 H/s [2019-01-21 05:59:42] accepted (59/0) diff 1140 (80 ms) [2019-01-21 05:59:43] speed 10s/60s/15m 38.3 38.3 38.3 H/s max 38.9 H/s
Have been any progress on multithreading of the Opteron 6276?
Regards
Am Mo., 21. Jan. 2019, 07:01 hat Jorge notifications@github.com geschrieben:
Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz sudo sysctl -w vm.nr_hugepages=128
cn/R sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o testnet.wowne.ro:3333 -u 9zuKdp6MrQaBSw85QP2fFh8zznN8tPbpJQZhX6wyVGHb5TdJu4QYcPK7EtfhUtnYbVUCFkNM9sPfViN5HvmXxV9iEVJP2o8 -p x -k [2019-01-21 05:09:40] new job from testnet.wowne.ro:3333 diff 1230 algo cn/r height 134
[2019-01-21 05:09:56] speed 10s/60s/15m 37.1 36.9 36.9 H/s max 37.7 H/s [2019-01-21 05:09:57] accepted (52/0) diff 1230 (88 ms) [2019-01-21 05:09:57] speed 10s/60s/15m 37.3 36.9 36.9 H/s max 37.7 H/s
cn/2 sudo ./xmrig --print-time 1 --av 1 --cpu-priority 4 -t 1 -o pool.wowne.ro:3333 -u Wo4fVqVg7prJTJ44hPmJK8e3bzBWL7gWVLnU8vSeg4NwUQvxuxLdB9xZY6PZHmhLxeCYPSxCYwcqkfa86Zf8gcFw2NRRyT3mN -p x -k [2019-01-21 05:58:58] new job from pool.wowne.ro:3333 diff 1140 algo cn/2
[2019-01-21 05:59:42] speed 10s/60s/15m 38.3 38.3 38.3 H/s max 38.9 H/s [2019-01-21 05:59:42] accepted (59/0) diff 1140 (80 ms) [2019-01-21 05:59:43] speed 10s/60s/15m 38.3 38.3 38.3 H/s max 38.9 H/s
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/SChernykh/CryptonightR/issues/5#issuecomment-455957498, or mute the thread https://github.com/notifications/unsubscribe-auth/AQUJAajfFGiVxKoMp35XcWzXqVPbI9xyks5vFVeggaJpZM4Z8UI3 .
@gvgit My first attempt to make fast double hash version using SSE failed - it wasn't faster than current code, so I have to think again.
@gvgit I've tried different approach for double hash SSE version and it seems to work, at least on Intel processors. I'll keep experimenting, but chances are good that double hash performance will be improved.
I am just about to test some more CPU's and post results.
What is "cn/half" algo ?
What is "cn/half" algo ?
It's not related to this testing. It's CNv2 with half iterations used in some recent coin forks.
Any success on double hash SSE regarding AMD CPUs?
The results are mixed. As soon as I fixed all bugs in the code, it wasn't faster on Intel and was a bit faster on Ryzen (but Ryzen doesn't need it since two single threads on the same core are still faster). I haven't tested it on older AMD processors yet. Double hash SSE version (test code) is available in https://github.com/SChernykh/CryptonightR - but it requires Windows + Visual Studio to compile and run.
New success on Opterons?
@gvgit Opterons are already faster with CryptonightR than they were with CNv2. I'm not sure SSE would help them because they have only one FP unit per 2 cores, but each core has integer unit. I still need to implement SSE code in xmrig to test it thoroughly.
@SChernykh CNv2 was a knockout for Opterons. If the hashrate won't get back to 1kh/s for a dual opteron system it will likely be the end for Opterons.
[ send by mobile device ]
Am Mo., 4. Feb. 2019, 18:42 hat SChernykh notifications@github.com geschrieben:
@gvgit https://github.com/gvgit Opterons are already faster with CryptonightR than they were with CNv2. I'm not sure SSE would help them because they have only one FP unit per 2 cores, but each core has integer unit. I still need to implement SSE code in xmrig to test it thoroughly.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/SChernykh/CryptonightR/issues/5#issuecomment-460341616, or mute the thread https://github.com/notifications/unsubscribe-auth/AQUJARGBGorK7H66PHxZaJy4SHlcXhMuks5vKHD2gaJpZM4Z8UI3 .
I have compiled XMRig on Ubuntu 16.04 successfully with these commands:
git clone https://github.com/SChernykh/xmrig.git cd xmrig git checkout CryptonightR mkdir build cd build cmake .. make
I now have a xmrig file in the build directory but have no idea how to run it for testing?
The only instructions for configuration is this text:
Usage
Use config.xmrig.com to generate, edit or share configurations.
That seems to reference a Windows program. So again please help me to get this running for testing on my HP DL580 and AMD R815 servers.
Any News about Opterons 6276?
[ send by mobile device ]
Am Di., 12. Feb. 2019, 20:20 hat kio3i0j9024vkoenio < notifications@github.com> geschrieben:
I have compiled XMRig on Ubuntu 16.04 successfully with these commands:
git clone https://github.com/SChernykh/xmrig.git cd xmrig git checkout CryptonightR mkdir build cd build cmake .. make
I now have a xmrig file in the build directory but have no idea how to run it for testing?
The only instructions for configuration is this text:
Usage
Use config.xmrig.com to generate, edit or share configurations.
That seems to reference a Windows program. So again please help me to get this running for testing on my HP DL580 and AMD R815 servers.
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/SChernykh/CryptonightR/issues/5#issuecomment-462896920, or mute the thread https://github.com/notifications/unsubscribe-auth/AQUJAV1i4YIJR774zTFX-chaX3UlL06hks5vMxQFgaJpZM4Z8UI3 .
@kio3i0j9024vkoenio
For CN/R
./xmrig --max-cpu-usage 100 -o testnet.wowne.ro:3333 -u 9zuKdp6MrQaBSw85QP2fFh8zznN8tPbpJQZhX6wyVGHb5TdJu4QYcPK7EtfhUtnYbVUCFkNM9sPfViN5HvmXxV9iEVJP2o8 -p x -k
You will need to configure threads accordingly.
Ryzen 7 1700 around 450H/S with 8 threads on latest xmrig (cn/wow)
@SChernykh, I don't know how many ARMv8 miners are out there, but just to let you know... it seems that CN/R has a big impact on ARMv8
Using xmrig 2.13.1 built with gcc/7.3.0 on a NanoPi M3 (Samsung S5P6818 Octa-Core Cortex-A53) these are my results:
variant 0: 28.6 H/s variant 1: 28.4 H/s variant 2: 27.2 H/s variant R: 14.9 H/s
ARM code doesn't have JIT code generator, I didn't think someone would actually use ARM processors for mining.
I don't mine on it, I was just testing it for fun. Maybe there are some people out there actually mining on them. I just thought I would post my results in case you weren't aware of the performance hit. Thanks for all your work btw :)
System is a HP DL580-G7 Server with four Xeon E7-8837 processors running Ubuntu 16.04
xmr-stak 2.10.0 56d2770
32 threads, 8 threads per processor, on each processor four double threads and four single threads
V8 - 1500.6 H/s R - 1498.8 H/s
So exactly the same for cryptonight_v8 and cryptonight_r
ARM definitely takes a major hit - xmrig no longer builds for iphone 6s and later (arm64 / aarch64) starting with v2.13.0, so I changed some of the build files to get a working binary. The new binaries run, but with about 75% reduced hashrate - i.e., from ~58.9 H/s to ~14.7 H/s!!
On RPi 3B running 64bit Ubuntu, the hashrate reduced from ~7.3 H/s to ~3.0 H/s.
So, an all around major reduction in ARM devices.
Any progress on Dual-Opteron 6276? CN/0 and CN/1 were around 1050kH/s. CN/2 was a knockout with 740kH/s. Will CN/R get it back to >1000kH/s?
@resistor4u ARM doesn't have JIT compiler for cn/r, this is why it's slower now. But it's possible to implement. @gvgit Most likely not, but cn/r was a bit faster than cn/2 last time I tested it.
Test results will be collected here. Please post your hashrate and power consumption (if you can measure it) compared to latest xmrig/xmrig-amd release on CryptonightV2.
1-2% hashrate drop is normal because CPU/GPU miner code is not fully optimized yet. In case there is a significant hashrate drop on some hardware, we have an option to reduce random math size appropriately.
Basic algorithm description:
Optimized CPU miner:
Optimized GPU miner:
Test pools: