Closed AlexGuo1998 closed 1 year ago
Similar issue here, but with default bucket size.
Bladebit Chia Plotter Version : 2.0.0 Git Commit : d64791880af89edebb6f1126c953d4d98b8007db Compiled With: msvc 19.29.30146
[Global Plotting Config] Will create 1 plots. Thread count : 16 Warm start enabled : false NUMA disabled : false CPU affinity disabled : false Farmer public key : xxx Pool contract address : xxx Output path : C:\Temp\
[Bladebit Disk Plotter] Heap size : 3.37 GiB ( 3447.82 MiB ) Cache size : 4.00 GiB ( 4096.00 MiB ) Bucket count : 256 Alternating I/O: false F1 threads : 16 FP threads : 16 C threads : 16 P2 threads : 16 P3 threads : 16 I/O threads : 1 Temp1 block sz : 4096 Temp2 block sz : 4096 Temp1 path : D:\ Temp2 path : D:\ I/O metrices enabled. Allocating memory WARNING: Forcing warm start for testing. Warm start: Pre-faulting memory pages... Memory initialized.
............ ............ ............
Table 7 Sorting : Completed in 25.13 seconds. Distribution : Completed in 2.45 seconds. Matching : Completed in 22.31 seconds. Fx : Completed in 25.97 seconds. Completed table 7 in 86.47 seconds with 4290248567 entries. Table 7 I/O wait time: 0.09 seconds. Table 7 I/O Metrics: Average read throughput 1298.40 MiB ( 1361.47 MB ) or 1.27 GiB ( 1.36 GB ). Total size read: 66265.89 MiB ( 69484.82 MB ) or 64.71 GiB ( 69.48 GB ). Total read commands: 196608. Average write throughput 2369.90 MiB ( 2485.02 MB ) or 2.31 GiB ( 2.49 GB ). Total size written: 79287.82 MiB ( 83139.31 MB ) or 77.43 GiB ( 83.14 GB ). Total write commands: 66306.
Sorting F7 & Writing C Tables
pause Press any key to continue . . .
Bladebit just stops during "Sorting F7 & Writing C Tables"
Tried cache sizes of 4G, 16G, 20G and 24G, same issue each time.
Fixed now, updated to 2.0.1
It is not fixed, you can see from the logs posted, version is 2.0.1
I am facing the same problem, default bucket size. It is random problem and not always applicable.
I just meant updating to 2.0.1 fixed the problem I was having.
It is not fixed, you can see from the logs posted, version is 2.0.1
It is fixed, whatever problem here, if a bug, is not in any way related to the previous issue in 2.0.0
The initial question here would be if the temp directory has enough space as using 512 buckets will require more because of alignment requirements
My setup is: Temp1 dedicated NVMe 1TB 50GB --cache (total of 64GB DDR4) 256 Bucket Size I will say temp has sufficient space in my case.
Running with -n 12 sometimes stops on 3rd plot, sometimes on 7th, sometimes will complete all 12, it is totally random problem for me. In last 24h it happen 3 times. Version is 2.0.1
The initial question here would be if the temp directory has enough space as using 512 buckets will require more because of alignment requirements
I think 580GB (540GiB) must be enough? Will check with more free space later.
Faulting application name: bladebit.exe, version: 0.0.0.0, time stamp: 0x636978c6 Faulting module name: bladebit.exe, version: 0.0.0.0, time stamp: 0x636978c6 Exception code: 0xc0000005 Fault offset: 0x0000000000146af0 Faulting process id: 0x35c0 Faulting application start time: 0x01d8f748e64ba8d6 Faulting application path: C:\BB\bladebit.exe Faulting module path: C:\BB\bladebit.exe Report Id: 518946e3-0a08-4fc1-8f37-325f9d8e95b7 Faulting package full name: Faulting package-relative application ID:
After digging into the code I assume this is a bug.
Length of _mapBitCounts
should be _numBuckets+ExtraBucket
i.e. _numBuckets
, 512. However it's defined as:
Oops.
Anything more than 256 buckets would crash, because uint32 _threadCount
get overridden to an arbitrary large number, making jobs[i]
pointing to an invalid address.
Firing a PR...
Thanks for taking the time to dig into this!
I'll try to have a look by tomorrow
BTW, I can confirm it works with my local build. No more crashes, and chia plots check
passed.
Here are the CI artifacts if you want to try yourself: https://github.com/AlexGuo1998/bladebit/actions/runs/3476962769#artifacts (x86-64 only, use at your own risk!)
Indeed I checked out your code review and you are correct. Well done! Bringing the convo over to the PR.
Fixed in #251
As mentioned in https://github.com/Chia-Network/bladebit/issues/241#issuecomment-1309212321, creating a new issue about this.
Bladebit v2.0.1 diskplot crashed with
-b 512
, while-b 256
works fine.Command used: (keys masked with
...
)where
TEMP
andOUT
are two seperatentfs
partitions (in HDDs).Logs:
Other information:
(Any other information needed?)
I'm able to build a debug build to collect core dumps if needed, however, my configuration is quite slow.