openethereum / parity-ethereum

The fast, light, and robust client for Ethereum-like networks.
Other
6.81k stars 1.68k forks source link

parity often stops syncing and needs to be restarted #11854

Open TomWang10 opened 4 years ago

TomWang10 commented 4 years ago

OpenEthereum version: OpenEthereum/v3.0.1-stable-8ca8089-20200601/x86_64-unknown-linux-gnu/rustc1.43.1 Operating system: Linux Installation: Binary executable Fully synchronized: yes Network: ethereum Restarted: yes

parity often stops syncing and needs to be restarted

rakita commented 4 years ago

Hm, not enough information to deduce anything about problem.

gsalzer commented 4 years ago

Same here, over and over again. Every other day.

gsalzer commented 4 years ago

And when sending openethereum the signal to stop, it acknowledges it in the log file ("Finishing work, please wait..."), but it won't stop. After killing the process it restarts without problem.

gsalzer commented 4 years ago

Hm, not enough information to deduce anything about problem.

How can we provide enough information? Any particular debug flags to activate?

rakita commented 4 years ago

Hello @gsalzer, you can start by including logs and giving us more information on your setup. And saying that the app just stops syncing is very broad sentence.

@TomWang10 When you say "often stops" it does not help, because often is very relative word, does it means once per hour, day or week.

gsalzer commented 4 years ago

@rakita My setup is identical to TomWang10's in the original posting. See the toml below.

"And saying that the app just stops syncing is very broad sentence."

There is not much more to say than i already said above: openethereum just hangs without consuming much resources except the memory it sits on, doesn't write any log messages, and doesn't react properly to TERM signals. In the log below you see two such incidents:

openethereum.log

config.txt

rakita commented 4 years ago

@gsalzer yep this seems like #11758 issue. Information that there is no logs (even periodic one) for hours fits with issue at hand.

You can read more on topic there

liamaharon commented 4 years ago

Same here. OpenEthereum is ~useless~ not viable for certain production use cases until this is fixed.

cogmon commented 4 years ago

Same here. OpenEthereum is useless until this is fixed.

Totally, same here. OpenEthereum is broken for node operators.

adria0 commented 4 years ago

@TomWang10 @gsalzer @liamaharon @cogmon

The current OE team released the 3.0 from the 2.7 branch without being aware that 2.7 was not production-ready. We have been working on them for months without success, and the Berlin Hard fork is every day closer.

After seeing that there a lot of problems with the DB version upgrade and super-hard-to-catch async errors we finally decided to backport starting from the 2.5.13 branch that is the more stable version known at the moment. We expect to release the 3.1, based on 2.5.13 by mid September.

cogmon commented 4 years ago

@adria0

In the release notes of v2.7.2, it doesn't say "not production-ready". The "stable" in the release name suggests differently.

Do I understand correctly that anyone who updated to 2.7.2 is now screwed and has to resync - which for certain node configurations is a matter of several months?

The problem seems to be the production release of 2.7 in the first place, not that OE builds on top of 3.0.

roninkaizen commented 4 years ago

due to my absolutely personal experiences, the limitation with cache-size 4096 changed the behaviour of my nodes back to stability, next would have been the reducing peer-amount down to 15, to declare: i do not run them "archive-mode" nor persistant tx-que but json-rpc activated- main-net

grafik and yes, those above mentioned versions were all compiled&tested :) to give an idea of persistance in watching

mmwanga commented 4 years ago

This is happening to me too on version 3.0.1. The absence of logs for a period of time is sufficient evidence of this issue.

./openEthereum/openethereum -d /data/parity/ --jsonrpc-interface all --jsonrpc-apis all --jsonrpc-cors "*" --geth --tx-queue-mem-limit=256 --tx-queue-size=8192 --cache-size=4096

mmwanga commented 3 years ago

@TomWang10 @gsalzer @liamaharon @cogmon

The current OE team released the 3.0 from the 2.7 branch without being aware that 2.7 was not production-ready. We have been working on them for months without success, and the Berlin Hard fork is every day closer.

After seeing that there a lot of problems with the DB version upgrade and super-hard-to-catch async errors we finally decided to backport starting from the 2.5.13 branch that is the more stable version known at the moment. We expect to release the 3.1, based on 2.5.13 by mid September.

Any updates on this? Eagerly waiting

adria0 commented 3 years ago

@TomWang10 @gsalzer @liamaharon @cogmon The current OE team released the 3.0 from the 2.7 branch without being aware that 2.7 was not production-ready. We have been working on them for months without success, and the Berlin Hard fork is every day closer. After seeing that there a lot of problems with the DB version upgrade and super-hard-to-catch async errors we finally decided to backport starting from the 2.5.13 branch that is the more stable version known at the moment. We expect to release the 3.1, based on 2.5.13 by mid September.

Any updates on this? Eagerly waiting

Please, read https://github.com/openethereum/openethereum/issues/11858