Chia-Network / bladebit

A high-performance k32-only, Chia (XCH) plotter supporting in-RAM and disk-based plotting
Apache License 2.0
340 stars 107 forks source link

3.1.0-rc2 - Twice as slow to generate a plot with an option to check at the end. #421

Open kkaloto opened 9 months ago

kkaloto commented 9 months ago

2 x XEON E5 - 2697 v.2 256gb RAM DDR3 ECC 2tb nvme Nvidia Quadro P4000

.\bladebit_cuda -f 1053966a134c623c0be8d1974500455c0d952bdb1073fe2fc6e9df94444406c0211d3f79666ce79f702b28e9d8286aca -c xch149q6r4ytaczn37tm58heuulxgceztr8klnmakjzxxgtqy3nu3kdszqu0st -n 1 --compress 7 cudaplot --check 100 --check-threshold 0.95 D:\

Bladebit Chia Plotter Version : 3.1.0-rc2 Git Commit : 31eba697164efeb29532805b74df00f4ffadcf60 Compiled With: msvc 19.29.30152

[Global Plotting Config] Will create 1 plots. Thread count : 48 Warm start enabled : false NUMA disabled : false CPU affinity disabled : false Farmer public key : 1053966a134c623c0be8d1974500455c0d952bdb1073fe2fc6e9df94444406c0211d3f79666ce79f702b28e9d8286aca Pool contract address : xch149q6r4ytaczn37tm58heuulxgceztr8klnmakjzxxgtqy3nu3kdszqu0st Compression Level : 7 Benchmark mode : disabled

[Bladebit CUDA Plotter] Host RAM : 255 GiB Plot checks : enabled ( 100 ) Plot check threshold: 0.950

Selected cuda device 0 : Quadro P4000 CUDA Compute Capability : 6.1 SM count : 14 Max blocks per SM : 32 Max threads per SM : 2048 Async Engine Count : 1 L2 cache size : 2.00 MB L2 persist cache max size : 0.00 MB Stack Size : 1.00 KB Memory: Total : 8.00 GB Free : 7.06 GB

Allocating buffers (this may take a few seconds)... Kernel RAM required : 91955994624 bytes ( 87696.07 MiB or 85.64 GiB ) Intermediate RAM required : 4378927104 bytes ( 4176.07 MiB or 4.08 GiB ) Host RAM required : 142270791680 bytes ( 135680.00 MiB or 132.50 GiB ) Total Host RAM required : 234226786304 bytes ( 223376.07 MiB or 218.14 GiB ) GPU RAM required : 6163050496 bytes ( 5877.54 MiB or 5.74 GiB ) Allocating buffers... Done.

Generating plot 1 / 1: 0fd16652f305eb16ab8133f0cbdb939d9e62705f77dd32974785cc8e9b627f48 Plot temporary file: D:\plot-k32-c07-2023-10-01-01-34-0fd16652f305eb16ab8133f0cbdb939d9e62705f77dd32974785cc8e9b627f48.plot.tmp

Generating F1 Finished F1 in 26.73 seconds. Table 2 completed in 91.29 seconds with 4294967296 entries. Table 3 completed in 149.37 seconds with 4294967296 entries. Table 4 completed in 223.12 seconds with 4294807525 entries. Table 5 completed in 209.37 seconds with 4294629658 entries. Table 6 completed in 179.60 seconds with 4294301621 entries. Table 7 completed in 136.25 seconds with 4293525319 entries. Finalizing Table 7 Finalized Table 7 in 56.31 seconds. Completed Phase 1 in 1074.90 seconds Marked Table 6 in 19.00 seconds. Marked Table 5 in 20.27 seconds. Marked Table 4 in 20.29 seconds. Marked Table 3 in 20.40 seconds. Completed Phase 2 in 79.97 seconds Compressing Table 2 and 3... Step 1 completed step in 64.44 seconds. Step 2 completed step in 59.84 seconds. Completed table 2 in 124.28 seconds with 3439805344 / 4294967296 entries ( 80.09% ). Compressing tables 3 and 4... Step 1 completed step in 50.47 seconds. Step 2 completed step in 81.31 seconds. Step 3 completed step in 65.76 seconds. Completed table 3 in 197.55 seconds with 3465878993 / 4294807525 entries ( 80.70% ). Compressing tables 4 and 5... Step 1 completed step in 48.02 seconds. Step 2 completed step in 75.52 seconds. Step 3 completed step in 64.94 seconds. Completed table 4 in 188.49 seconds with 3532552082 / 4294629658 entries ( 82.26% ). Compressing tables 5 and 6... Step 1 completed step in 47.78 seconds. Step 2 completed step in 80.11 seconds. Step 3 completed step in 69.98 seconds. Completed table 5 in 197.87 seconds with 3712906184 / 4294301621 entries ( 86.46% ). Compressing tables 6 and 7... Step 1 completed step in 48.28 seconds. Step 2 completed step in 82.13 seconds. Step 3 completed step in 78.20 seconds. Completed table 6 in 208.61 seconds with 4293525319 / 4293525319 entries ( 100.00% ). Serializing P7 entries Completed serializing P7 entries in 40.00 seconds. Completed Phase 3 in 956.82 seconds Completed Plot 1 in 2111.69 seconds ( 35.19 minutes )

Checking 100 random proofs with seed 0x817a865871628af6e33f1610f9321123395f749d857c6439cfdf3642d15a0c4c... Plot compression level: 7 10%... 20%... 30%... 40%... 50%... 60%... 70%... 80%... 90%... Seed used: 0x817a865871628af6e33f1610f9321123395f749d857c6439cfdf3642d15a0c4c Proofs requested/fetched: 84 / 100 ( 84.000% ) Proof validation failed : 3 ( 3.000% )

WARNING: Deleting plot 'D:\plot-k32-c07-2023-10-01-01-34-0fd16652f305eb16ab8133f0cbdb939d9e62705f77dd32974785cc8e9b627f48.plot.tmp' as it is below the proof threshold: 0.840 / 0.950.

Completed writing plot in 135.99 seconds

chiabetatester commented 9 months ago

i'm also having the same issue of slow plot time - twice as slow as the alpha 4.4 bb cuda version. and so far in 3.1.0-rc2 every plot has been deleted with ratio 0.2-0.6 not meeting the 0.8 requirement