algorandfoundation / algorun

MIT License
29 stars 2 forks source link

Node crashes during sync - database or disk is full #2

Open Concentrate1234 opened 1 year ago

Concentrate1234 commented 1 year ago

Installation on Win11 + Docker + Python works well, but when starting the node, it crashes after a few minutes. After some investigations, it seems the partition /algod/data is full:

# df -h Filesystem Size Used Avail Use% Mounted on overlay 1007G 4.2G 952G 1% / tmpfs 64M 0 64M 0% /dev tmpfs 3.9G 0 3.9G 0% /sys/fs/cgroup shm 64M 0 64M 0% /dev/shm overlay 3.9G 3.9G 0 100% /algod/data /dev/sde 1007G 4.2G 952G 1% /etc/hosts tmpfs 3.9G 0 3.9G 0% /proc/acpi tmpfs 3.9G 0 3.9G 0% /sys/firmware

Is it possible to change the docker container to have enough space to sync? Thanks.

Loedn commented 1 year ago

Hi @Concentrate1234 thanks for reporting it, quick question how much space do you have allocated for the container? In Docker Desktop you can find this setting under Settings > Resources > Advanced

Concentrate1234 commented 1 year ago

Hi, seems ressource limit is managed by Windows. See below screenshot:

Screenshot 2023-07-06 175823

Concentrate1234 commented 1 year ago

I uninstalled Docker Desktop and reinstalled it without WSL2. Set the ressources as below and still get the /algod/data partition full after 5/10 minutes of running... It seems this partition is really too small (I have 250 GB on / but only 4.9 GB on /algod/data).

Screenshot 2023-07-06 190852

mimba commented 1 year ago

I have the same issue, using python 3.11, windows 11 and docker with wsl. I have about 200 GB free disk, but algorun reports the disk is full

Concentrate1234 commented 1 year ago

ok I found a workaround:

Now in the docker container, the /data/algod is in / partition where there is enough space:

df -h

Filesystem Size Used Avail Use% Mounted on overlay 1007G 3.1G 953G 1% / tmpfs 64M 0 64M 0% /dev tmpfs 3.9G 0 3.9G 0% /sys/fs/cgroup shm 64M 0 64M 0% /dev/shm /dev/sde 1007G 3.1G 953G 1% /etc/hosts overlay 3.9G 913M 3.0G 24% /etc/algorand/config.json tmpfs 3.9G 0 3.9G 0% /proc/acpi tmpfs 3.9G 0 3.9G 0% /sys/firmware

Concentrate1234 commented 1 year ago

Final solution for this issue:

  1. Stop the node with "algorun stop"
  2. Edit the docker-compose.yml file generated in the same folder as the algorun command was ran
  3. Replace the lines
    • ${PWD}/data:/algod/data/:rw
    • ${PWD}/config.json:/etc/algorand/config.json:rw with:
    • ./data:/algod/data/:rw
    • ./config.json:/etc/algorand/config.json:rw
  4. Save
  5. Run the "algorun start" command (in the same folder as editted docker-compose.yml file