Open romanzac opened 2 weeks ago
the second log you shared seems to show the node has finished syncing - does the /health endpoint still not return true?
the fourth log shows the node has not finished syncing
on my own node -
seems to be the health endpoint returning not ready
and healthy
in a flaky manner. cc: @NagyZoltanPeter
I see in your logs that we use debug/v1/info
in the test - which results in
Response status code: 200. Response content: b'{"listenAddresses":["/ip4/172.18.205.233/tcp/6886/p2p/16Uiu2HAmGNtM2rQ8abySFNhqPDFY4cmfAEpfo9Z9fD3NekoFR2ip","/ip4/172.18.205.233/tcp/6887/ws/p2p/16Uiu2HAmGNtM2rQ8abySFNhqPDFY4cmfAEpfo9Z9fD3NekoFR2ip"],"enrUri":"enr:-LO4QGGlww8liwBmcHFHdLcXwt-Uq0c6iU6cdDJ6pWlh2avnWILMdWa9P_iCS0kiWhLuECjRTMvxoykPXyP5sKjcx88BgmlkgnY0gmlwhKwSzemKbXVsdGlhZGRyc4wACgSsEs3pBhrn3QOCcnOFAAABAACJc2VjcDI1NmsxoQM3Tqpf5eFn4Jztm4gB0Y0JVSJyxyZsW8QR-QU5DZb-PYN0Y3CCGuaDdWRwghrohXdha3UyAQ"}'
INFO src.node.waku_node:waku_node.py:193 REST service is ready !!
does node1 not respond after this is available?
testing with https://github.com/waku-org/nwaku/pull/2664
on my own node - seems to be the health endpoint returning
not ready
andhealthy
in a flaky manner. cc: @NagyZoltanPeter
It's a known issue, already solved here: https://github.com/waku-org/nwaku/pull/2612 Will be part of next release!
awesome, i might be missing something, but does the pr address the node healthy
and not ready
flakiness?
suggest to use this image - quay.io/wakuorg/nwaku-pr:2664-rln-v1
awesome, i might be missing something, but does the pr address the node
healthy
andnot ready
flakiness?
Indeed! It separates node ready from initialization in general from status of rln_relay. There is a - future improved - array of protocol statuses, in there you can see rln_relay status now (and yet there is changing from ready to synchronize and back as of actual status). Likewise:
{
"nodeHealth": "Ready",
"protocolsHealth": [
{
"Rln Relay": "Ready"
}
]
}
awesome, i might be missing something, but does the pr address the node
healthy
andnot ready
flakiness?Indeed! It separates node ready from initialization in general from status of rln_relay. There is a - future improved - array of protocol statuses, in there you can see rln_relay status now (and yet there is changing from ready to synchronize and back as of actual status). Likewise:
{ "nodeHealth": "Ready", "protocolsHealth": [ { "Rln Relay": "Ready" } ] }
wow, nice 🔥
Good progress on health info indeed! I'll wait for the next release to enable onchain tests. And I will try to play with /health endpoint on Monday. Thanks for now.
@rymnc: Hi, about the original issue. From the logs I don't see the actual problem. The first log tells me that node is started well. It took almost 7 min to sync on-chain RLN blocks. The second node (or log) ends with still in sync, but only 3 min from startup. Maybe the start sequence needs to be checked as it seems to me that nodes started with delay while the timeout applies from the first node start, from the 10 min timeout I think. Can you please check this scenario. Maybe you need to extend the timeout to give enough time for the second node to get in sync.
Adding logs here from my yesterdays testing for PR2664 I have also started to work on integrating node health check for interop tests https://github.com/waku-org/waku-interop-tests/pull/35 . Hopefully we can have next nwaku release and matching interop tests for RLN matured at the same time. Say no to QA lag! :)
Problem
During "test_publish_with_valid_payloads_dynamic_at_slow_rate" execution, first container running Node1 won't start even within the timeout of 10 minutes. Static mode is working fine. Please have a look.
I've built docker image from release: wakunode2-v0.27.0
Impact
High occurrence, medium severity. RLN in dynamic mode not functional.
To reproduce
Expected behavior
RLN Relay in on-chain mode working.
Screenshots/logs
node1_2024-05-03_12-45-385a5531e5-c304-462b-a7f3-f58ba92d0a0bharbor.status.im_wakuorg_nwaku:latest.log node1_2024-05-03_12-45-38c9737de4-6df4-4f4c-b308-323968164308harbor.status.im_wakuorg_nwaku:latest.log node1_2024-05-03_12-46-0196df9f86-46da-4444-8b3a-7b6d0e7e060aharbor.status.im_wakuorg_nwaku:latest.log node2_2024-05-03_12-45-38c9737de4-6df4-4f4c-b308-323968164308harbor.status.im_wakuorg_nwaku:latest.log test_run.log