Chia-Network / bladebit

A high-performance k32-only, Chia (XCH) plotter supporting in-RAM and disk-based plotting
Apache License 2.0
336 stars 109 forks source link

*** Panic!!! *** Fatal Error: CUDA error cudaErrorIllegalAddress : an illegal memory access was encountered. #290

Open yxz215 opened 1 year ago

yxz215 commented 1 year ago

image

Bladebit Chia Plotter Version : 3.0.0-alpha1 Git Commit : f269db0a7ad307514e993c335897cea7ebf46eda Compiled With: gcc 9.4.0

[Global Plotting Config] Will create 1 plots. Thread count : 48 Warm start enabled : false NUMA disabled : false CPU affinity disabled : false Farmer public key : a268eb2fd1de20fbb6ec19ac9350557873bf2c78622c1a8a546524da59df4c4f15b0d59b0f6497ee2864f3fa17c0c4f1 Pool contract address : xch199vfdmwrmzyxg4938r2tyfvler4q20v77r45w2rcurkefhlwypjq0sgh27 Compression Level : 7 Benchmark mode : disabled

[Bladebit CUDA Plotter] Selected cuda device 0 : NVIDIA GeForce RTX 3060 Ti CUDA Compute Capability : 8.6 SM count : 38 Max blocks per SM : 16 Max threads per SM : 1536 Async Engine Count : 2 L2 cache size : 3.00 MB L2 persist cache max size : 2.25 MB Stack Size : 1.00 KB Memory: Total : 7.78 GB Free : 7.65 GB

Allocating buffers (this may take a few seconds)... Kernel RAM required : 90240524288 bytes ( 86060.07 MiB or 84.04 GiB ) Intermediate RAM required : 2999001088 bytes ( 2860.07 MiB or 2.79 GiB ) Host RAM required : 141733920768 bytes ( 135168.00 MiB or 132.00 GiB ) Total Host RAM required : 231974445056 bytes ( 221228.07 MiB or 216.04 GiB ) GPU RAM required : 6135128064 bytes ( 5850.91 MiB or 5.71 GiB ) Allocating buffers

Generating plot 1 / 1: 335423bef4f54eb5e858f34c0c5e19e0fca4c630126bf7f71d8aea96c36e5065 Plot temporary file: /mnt/dat0/plot-k32-c07-2023-02-17-01-43-335423bef4f54eb5e858f34c0c5e19e0fca4c630126bf7f71d8aea96c36e5065.plot.tmp

Generating F1 Finished F1 in 1.50 seconds. CUDA error: 700 (0x2bc) cudaErrorIllegalAddress : an illegal memory access was encountered

Panic!!! Fatal Error:
CUDA error cudaErrorIllegalAddress : an illegal memory access was encountered. ./bladebit_cuda(+0xcf8cb)[0x55aaa95458cb] ./bladebit_cuda(+0xcf0af)[0x55aaa95450af] ./bladebit_cuda(+0x5217a)[0x55aaa94c817a] ./bladebit_cuda(+0x19aa0)[0x55aaa948faa0] ./bladebit_cuda(+0x1dcfb)[0x55aaa9493cfb] ./bladebit_cuda(+0x18245)[0x55aaa948e245] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3)[0x7f0941e06083] ./bladebit_cuda(+0x1974e)[0x55aaa948f74e]

Message from syslogd@tw at Feb 17 01:44:31 ... kernel:[46574.443888] watchdog: BUG: soft lockup - CPU#17 stuck for 22s! [bladebit_cuda:4360]

Is there a solution to this problem? ubuntu version is 20.04 amd 3960x 256gb ram 3060TI gpu 2 tb nvme 1676598803

brianwfreeman commented 1 year ago

I encountered the same issue and it made my system lock up with blank screen.

junjunwang2016 commented 1 year ago

I encountered the same issue

he426100 commented 1 year ago

just upgrade the kernel

apt install linux-headers-5.15.0-69-generic linux-image-5.15.0-69-generic
Perk-Mew commented 9 months ago

I encountered the same issue but in windows 11 how could i fix it?