Closed sobertram closed 9 months ago
Just happened again after entering this ticket. Just wanted to add it is not always around 5GB it just takes up enough so that the plotter cannot allocate enough memory. In the past it would sit steadily at ~500MB. This instance it took ~3GB and happened much faster than before. Will see if i can find anything else in the logs that give a clue to what triggers it.
Thu Dec 14 11:32:05 2023
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.104.05 Driver Version: 535.104.05 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 Tesla P4 Off | 00000000:01:00.0 Off | 0 |
| N/A 82C P0 26W / 75W | 3836MiB / 7680MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+
+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| 0 N/A N/A 839768 C chia_harvester 3832MiB |
+---------------------------------------------------------------------------------------+
Farming C20 plots takes that much VRAM... It only shoots up that high when it encounters a C20 plot and as you say you just started replotting with GH you probably only have a few so far.
https://github.com/madMAx43v3r/chia-gigahorse?tab=readme-ov-file#ram--vram-requirements-to-farm
You may want to reconsider C20 if you're only farming with a P4.
Farming C20 plots takes that much VRAM... It only shoots up that high when it encounters a C20 plot and as you say you just started replotting with GH you probably only have a few so far.
https://github.com/madMAx43v3r/chia-gigahorse?tab=readme-ov-file#ram--vram-requirements-to-farm
You may want to reconsider C20 if you're only farming with a P4.
Thanks so as long as i am not plotting in parallell it should be able to manage it. I will take this into consideration as I press forward.
As @duncandubick pointed out this seems consistent and makes sense. closing.
In case others run into this, as a fix i moved harvesting duties from the plotter to another harvester following instructions here, https://github.com/madMAx43v3r/chia-gigahorse?tab=readme-ov-file#remote-compute. Another fix could also be to add another GPU to your plotter.
Using latest:
This would have gone unnoticed but because i am also plotting on this machine. The plotter crashed with the following message:
Did nvidia-smi right after the crash and noticed the high mem use of the harvester: nvidia-smi
I restarted the harvester and that cleared it up. Typical memory use for harvesting is
520MiB
. Here is memory usage when both plotter and harvester are running:Running on
5.4.0-167-generic #184-Ubuntu SMP Tue Oct 31 09:21:49 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux
kernel.I am just switching my system over to gigahorse and got through 11 plots, this failed on phase2 of the 12th plot. System was running for about 2.5 to 3 hours before the issue.
Plotting info:
Last thing in log before the crash: