trexminer / T-Rex

T-Rex NVIDIA GPU miner with web control monitoring page
2.64k stars 439 forks source link

TREX: Can't find nonce with device cuda exception: CUDA_ERROR_UNKNOWN #1376

Open traderdude123 opened 2 years ago

traderdude123 commented 2 years ago

Hi,

I keep getting this error after 24 hours of running.

TREX: Can't find nonce with device [ID=4, GPU #4], cuda exception: CUDA_ERROR_UNKNOWN 20220722 17:57:17 WARN: Miner is going to shutdown... 20220722 17:57:17 Main loop finished. Cleaning up resources... 20220722 17:57:17 ApiServer: stopped listening on 0.0.0.0:4067 20220722 17:57:18 WARN: NVML: can't get temperature for GPU #0, error code 999 20220722 17:57:18 WARN: NVML: can't get fan speed for GPU #0, error code 999 20220722 17:57:18 WARN: NVML: can't get power for GPU #0, error code 999 20220722 17:57:18 WARN: NVML: can't get mem/core clock for GPU #0, error code 999 20220722 17:57:21 T-Rex finished. 20220722 17:57:22 WARN: WATCHDOG: T-Rex does not exist anymore, restarting... 20220722 17:57:23 T-Rex NVIDIA GPU miner v0.24.8 - [Windows]

jesseharding24 commented 2 years ago

Have you tried increasing virtual memory? In your windows search bar, type in "Adjust the appearance and performance of windows". Then navigate to the 'Advanced' tab and under 'Virtual memory' click change.

I believe a good rule of thumb is to allocate 4GB of virtual memory per GPU installed in your system.

Hope this helps

MedMall commented 2 years ago

Hi,

I keep getting this error after 24 hours of running.

I have the same problem. I think it's because my laptop video card has 6 GB of video memory. P.S. "TREX: Can’t find nonce with device [ID=0, GPU #0], cuda exception: CUDA_ERROR_OUT_OF_MEMORY"

traderdude123 commented 2 years ago

Have you tried increasing virtual memory? In your windows search bar, type in "Adjust the appearance and performance of windows". Then navigate to the 'Advanced' tab and under 'Virtual memory' click change.

I believe a good rule of thumb is to allocate 4GB of virtual memory per GPU installed in your system.

Hope this helps

I have given sufficient virtual mem. i have total of 7 Cards(5x3090+2x3070) and 250 GB dedicated SSD. it happens after a full days run. but now i m seeing a pattern. its gpu no 4(3090) that always causing this issue. but what exactly is the issue with card is unknown.