Open George-Gate opened 7 years ago
I also try the CUDA_Tromp miner with CUDA7.5 on my server, I'm so amazing that only 17 Sols/s with Tesla K80 ...... Is it normal?
Me too.
I use 1080 but only 26sols / s is measured.
I think I'm going crazy.
So I went through the CUDA version in README.md and now I get a cuda error.
Never change the CUDA version.
be careful.
If you are curious, check out the issue I just uploaded.
Could not make it work. 26 Sol/s seems like only the CPU is working
[12:05:05][0x00007f76aea8b700] miner#0 | Starting thread #0 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76ada89700] miner#2 | Starting thread #2 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76ad288700] miner#3 | Starting thread #3 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76ae28a700] miner#1 | Starting thread #1 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76aca87700] miner#4 | Starting thread #4 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76af28c700] stratum | Connecting to stratum server equihash.jp.nicehash.com:3357
[12:05:05][0x00007f7697fff700] miner#5 | Starting thread #5 (CPU-XENONCAT-AVX2)
[12:05:05][0x00007f76977fe700] miner#6 | Starting thread #6 (cuda_djezo_STUB)
I'm running on the CPU
It's actually possible to compile and run djezo with CUDA 7.5 (managed that with 7.0).
The error @George-Gate is getting is because a new intrinsic load appeared in CUDA 8.0: __ldca
. As suggested on the developer forum
Another developer suggested an implementation of cache load that looks roughly equivalent to the missing __ldca
.
From that, I've added the following function to djezo's source code
__device__ int loadThroughL1Cache(int* p)
{
int out;
asm("ld.global.ca.s32 %0, [%1];" : "=r"(out) : "l"(p));
return out;
}
Compiles fine but runs slow.. getting around 15-20 sol/s on a GTX 780.
My server only have CUDA 7.5 runtime library, so I want to know if I can compile the new cuda miner with CUDA 7.5. I have tried it but the compiler gives errors: