qubic-li / client

248 stars 25 forks source link

GPU mining crash #86

Closed StraySnake closed 3 months ago

StraySnake commented 3 months ago

System : HiveOS 0.6-222@230511 Nvidia drivers : 525.116.04 Qubic client : HiveOs x64 1.8.10

Since 2 days my miner reboot all ~6 hours and with same result with fresh install.

I've two types of crash logs :

[Sat Apr 13 11:09:00 2024] NVRM: Xid (PCI:0000:01:00): 62, pid='', name=, badfbadf(badfbadf) 00000000 00000000
[Sat Apr 13 11:09:00 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000010
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000011
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000012
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000013
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000014
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000015
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000016
[Sat Apr 13 11:09:05 2024] NVRM: Xid (PCI:0000:01:00): 45, pid=5153, name=qli-runner, Ch 00000017
[Fri Apr 12 22:16:59 2024] NVRM: Xid (PCI:0000:01:00): 13, pid='', name=, Graphics SM Warp Exception on (GPC 7, TPC 4, SM 0): Out Of Range Address
[Fri Apr 12 22:16:59 2024] NVRM: Xid (PCI:0000:01:00): 13, pid='', name=, Graphics Exception: ESR 0x53e730=0xc02000e 0x53e734=0x20 0x53e728=0xf81eb60 0x53e72c=0x1174
[Fri Apr 12 22:26:31 2024] NVRM: Xid (PCI:0000:01:00): 43, pid=6454, name=qli-runner, Ch 00000010

Thanks

StraySnake commented 3 months ago

I've found the solution... Since the Epoch 104 my OC for my RTX 4090 crash the drivers Old : nvtool --setcoreoffset 200 --setclocks 2900 --setmem 7000 --setmemoffset 2000 New : nvtool --setcoreoffset 200 --setclocks 1900 --setmem 7000 --setmemoffset 2000 (I think I can OC little bit more)