shardeum / bug-reporting

59 stars 35 forks source link

Validator Stopped With Error ```Could not get seed list from seed node server, connect ENETUNREACH``` On TCP Port 4000 #55

Closed Shekelme closed 1 year ago

Shekelme commented 1 year ago
What is the issue?
My validator is stopping after a while with this error in dashboard:

Node exited with following message: Fatal: Could not get seed list from seed node server http://archiver-sphinx.shardeum.org:4000/nodelist: connect ENETUNREACH 3.127.57.166:4000

I have faced this problem twice already.

What impact does the issue have?

I can't run the validator 
How to reproduce the issue?
Simply start the validator in accordance with docs.shardeum.org, forwsrd the ports (9001 amd 10001) and wait for a while.
What other resources can you share regarding this issue?

image

MarcusWentz commented 1 year ago

It's possible the validator is not able to reach the archiver based on its current location.

Are you able to do a ping or trace route the archiver from that validator?

Shekelme commented 1 year ago

Yes. image

Shekelme commented 1 year ago

image

MarcusWentz commented 1 year ago

It looks like most packets are transmitted.

A few questions:

Q1. Is your validator on a local computer or remote server?

Q2. What is your current tested bandwidth

https://www.google.com/search?client=firefox-b-1-lm&q=google+test+speed

?

Q3. Are you having other network issues?

Shekelme commented 1 year ago

A1. Local PC A2.

Testing download speed................................................................................ Download: 305.36 Mbit/s Testing upload speed...................................................................................................... Upload: 40.97 Mbit/s

A3. I didn't notice anything significant, other applications are working fine.

MarcusWentz commented 1 year ago

@ANiX5202

https://github.com/Shardeum/shardeum-bug-reporting/issues/54#issuecomment-1432072371

Please answer the following questions as well:

Q1. Is your validator on a local computer or remote server?

Q2. What is your current tested bandwidth

https://www.google.com/search?client=firefox-b-1-lm&q=google+test+speed

?

Q3. Are you having other network issues?

ANiX5202 commented 1 year ago

@ANiX5202

#54 (comment)

Please answer the following questions as well:

Q1. Is your validator on a local computer or remote server?

Q2. What is your current tested bandwidth

https://www.google.com/search?client=firefox-b-1-lm&q=google+test+speed

?

Q3. Are you having other network issues?

A1. Remote server Ubuntu 20.04 A2. Node Name Upload Speed Download Speed Latency Speedtest.net 1989.05 Mbps 12111.80 Mbps 0.26 ms A3. I haven't

MarcusWentz commented 1 year ago

@Shekelme

Do you have logs when the validator goes from active to inactive in the CLI and GUI?

https://docs.shardeum.org/node/run/validator#step-6-monitor-validator

@ANiX5202

Were you able to run a validator in the past?

If so, was the remote server location the same when it was working before?

Shekelme commented 1 year ago

BTW my validator was working during Sphinx 1.0. Not sure how to watch the logs, but here is what it says in CLI:

node@8b031bec162e:~/app$ operator-cli status state: stopped exitMessage: 'Fatal: Node cannot join with invalid external IP: 0.0.0.0' exitStatus: Exit with error performance: memPercentage: 56.53958321089468 diskPercentage: 33.34332108101633 cpuPercentage: 2.3213194868662135 stakeRequirement: '10.0' lockedStake: '14.0' nominatorAddress: '0xf9055a98e7947d4000fe9a2ae30ae3e11b0bf58a'

And the same message is in GUI: image

chanderpalrathore commented 1 year ago

image Getting same error for me

mihauman commented 1 year ago

image

I have the same error

Jensnicolaigustavsen commented 1 year ago

alright my node is up and running!

Jensnicolaigustavsen commented 1 year ago

let me know if you have any problems

mihauman commented 1 year ago

let me know if you have any problems

ok thx

Shekelme commented 1 year ago

@Jensnicolaigustavsen What did you do to make your node work?

MarcusWentz commented 1 year ago

@Shekelme

Is port 4000 open on your system?

I heard someone opened this port and the network timeouts were fixed.

Shekelme commented 1 year ago

I have opened it today. Should I restart Shardeum validator somehow? Because I see it again: Node exited with following message: Fatal: Could not get seed list from seed node server http://archiver-sphinx.shardeum.org:4000/nodelist: connect ENETUNREACH 3.127.57.166:4000

MarcusWentz commented 1 year ago

Try this:

Go into the operator dashboard docker (may be different if you customized install location:

cd ~/.shardeum
./shell.sh

Get your node's external IP:

curl https://ipinfo.io/ip

The returned IP in the format of nnn.nnn.nnn.nnn is your EXTERNAL_IP.

Set the number above in place of EXTERNAL_IP:

export APP_IP="EXTERNAL_IP"

https://docs.shardeum.org/node/run/validator#step-9-stake-shm-to-validator

Shekelme commented 1 year ago

Done already. No positive effect...

mihauman commented 1 year ago

Shekelme You restarted the node after opening the port?

mizovsky2304 commented 1 year ago

Capture d’écran 2023-02-16 à 17 00 19

I have the same issue...

Shekelme commented 1 year ago

@mihauman Restarted the node via web GUI dashboard, and aftere a while I restarted the docker container. But again: image I even opened the port 3001....

mihauman commented 1 year ago

Shekelme open port 4000 and 4001 and restart node

Shekelme commented 1 year ago

@mihauman No effect, still the same error. But this time, it seems, the node worked for several hours... image

MarcusWentz commented 1 year ago

You can open port 4000 in Linux with:

sudo ufw allow 4000

https://www.digitalocean.com/community/tutorials/opening-a-port-on-linux

Shekelme commented 1 year ago

I can't, since my ufw is always inactive.

tbros6868 commented 1 year ago

How to open ports 4000 and 4001 and then restart the validator:

sudo apt install ufw
sudo ufw allow 4000
sudo ufw allow 4001
ufw allow ssh
ufw allow 8080
ufw allow https
ufw allow http
ufw allow 443
ufw enable
sudo ufw reload
sudo ufw status
Shekelme commented 1 year ago

I like this project so much. OK, will try to run it on separate PC.

ShriyanshShukla commented 1 year ago

Not working, getting the same error

image

MarcusWentz commented 1 year ago

@Shekelme

We are looking into the archive node to see what might be causing this.

mizovsky2304 commented 1 year ago

Same issue here... I think the devs had a lot of work the last three days... 1- public key address 2- seed list: port 4000 problem 3- today: _getSeedNodes Capture d’écran 2023-02-17 à 16 48 28

xkemox commented 1 year ago

IMG_20230218_011551 All ports are open still got the same error.. Anyone manage to fix it?

Jensnicolaigustavsen commented 1 year ago

im also getting this error: image

Jongjh commented 1 year ago

Im facing the same issue - Node exited with following message: Fatal: Could not get seed list from seed node server http://archiver-sphinx.shardeum.org:4000/nodelist: Timeout awaiting 'request' for 5000ms

image

anaderiasoft commented 1 year ago

Im facing the same issue - Node exited with following message: Fatal: Could not get seed list from seed node server http://archiver-sphinx.shardeum.org:4000/nodelist: Timeout awaiting 'request' for 5000ms

image

yes same "ufw allow 4000" doesnt give any result

ostd6212 commented 1 year ago

Similar issue Screenshot 2023-02-18 at 14 20 42 copy

zodiac55 commented 1 year ago

Node exited with following message: Cannot read properties of undefined (reading 'id')

What can be a problem?

mihauman commented 1 year ago

Who had a port 4000 error in shardeum

  1. sudo nano /etc/ufw/before.rules add next row before COMMIT row
  2. -A ufw-before-input -p tcp --dport 4000 -j ACCEPT
  3. Save the files and close the file.
  4. sudo ufw reload
Shekelme commented 1 year ago

@mihauman But what if ufw is simply disabled?

ostd6212 commented 1 year ago

![Uploading Screenshot 2023-02-20 at 11.28.59.png…]()

mihauman commented 1 year ago

Shekelme 1)sudo ufw status 2)sudo ufw enable 3)sudo nano /etc/ufw/before.rules 4)add next row before COMMIT row: -A ufw-before-input -p tcp --dport 4000 -j ACCEPT 5)-A ufw-before-input -p tcp --dport 4000 -j ACCEPT 6)sudo ufw reload

MarcusWentz commented 1 year ago

It sounds like we have a potential bug fix for this with the latest validator update.

Update your validator with:

curl -O https://gitlab.com/shardeum/validator/dashboard/-/raw/main/installer.sh && chmod +x installer.sh && ./installer.sh

https://docs.shardeum.org/node/run/validator#step-2-download-and-install-validator

Does this error still occur after the latest validator update?

Shekelme commented 1 year ago

@MarcusWentz Unfortunately yes, still the same error: image

ehenry-50 commented 1 year ago

I am having the same error-Node exited with following message: Fatal: Could not get seed list from seed node server http://archiver-sphinx.shardeum.org:4000/nodelist: Timeout awaiting 'request' for 5000ms

scorpion141 commented 1 year ago

I have the Same Error.

grafik

Ports are open:

grafik

Shekelme commented 1 year ago

@scorpion141 It seems that you forgot 9001 and 10001.

scorpion141 commented 1 year ago

@Shekelme thank you very mutch.

i opend the Porta now but i get a Link in the Discord to test the IP and the Port and it says closed to me. Maybe its a Problem with the VPS.

https://www.yougetsignal.com/tools/open-ports/

scorpion141 commented 1 year ago

Ok i think that is not correct the Page said Port 8080 is closed but i can connect on the Webpage.

scorpion141 commented 1 year ago

@Shekelme got the Same Error with the New Ports opend.

Iam also Try this without a Firewall and i ask the Service Provider if they Blocked Ports but no.

MarcusWentz commented 1 year ago

Please update your validator to version 1.1.1.

Here's what you need to do:

  1. Stop and unstake your validator.
  2. If you're within the Validator CLI:
exit

then:

cd ..
  1. Enter:
curl -O https://gitlab.com/shardeum/validator/dashboard/-/raw/main/installer.sh && chmod +x installer.sh && ./installer.sh
  1. Start the validator.
  2. Stake 10 SHM.