Closed okdas closed 5 months ago
CCing @ezeike @adshmh as well. This is the first (of many) tickets where protocol & backend work will start to overlap.
Go routines leaks has been significantly reduced by #410. Providing results to compare against the numbers resulting from the load test need to be done on the same hardware.
@okdas , could you please run load tests again on PR #410's branch with the same setup so we can compare numbers?
@red-0ne
could you please run load tests again on PR https://github.com/pokt-network/poktroll/pull/410's branch with the same setup so we can compare numbers?
Do we not have a way to reproduce this locally?
I recall reviewing & updating the instructions here: https://dev.poktroll.com/infrastructure/testing/load_testing
The leaks count is request dependent, and reference numbers would not be the same as the load testing script is 100VUs x 1minute with as much requests as possible. So numbers may differ from one machine to the other.
The leaks count is request dependent, and reference numbers would not be the same as the load testing script is 100VUs x 1minute with as much requests as possible. So numbers may differ from one machine to the other.
Got it.
My goal is not to have a specific number but understand how to reproduce it.
For example:
The leaks count is request dependent, and reference numbers would not be the same as the load testing script is 100VUs x 1minute with as much requests as possible. So numbers may differ from one machine to the other.
Got it.
My goal is not to have a specific number but understand how to reproduce it.
For example:
- If I wanted to check/observe/see if there are any leaks on my LocalNet, which doc do I read/follow to do this?
- If I wanted to check/observe/see if there are any leaks on my DevNet, which doc do I read/follow to do this?
curl localhost:PORT | grep go_goroutines
(9003 for appgate server, 9004 for relay miner)make send_relay
2.
again and see the difference for a single relay.I will document this in docusaurus. Will create a ticket to capture it.
@red-0ne posts an update next friday on how many go routine leaks we have. The goal is zero. If there is a VERY clear path to getting to zero, keep this open. If there's isn't, close it out and new tickets will be created when necessary.
@okdas Do you think we have resolved enough go-routines to close this one out?
@Olshansk looking at resource utilization on DevNets - we still saturate CPU when not serving requests. I'll get some pprof snapshots from these nodes so we can investigate.
Perfect, thanks for the update @okdas!
I think we're in a good position now with one exception we are going to work on - #551.
Objective
Both AppGate and RelayMiner creating, but not cleaning up goroutines at approx 2 goroutines per relay.
Origin Document
Goals
How to reproduce
make localnet_up
make supplier1_stake && make app1_stake
&&make test_e2e
AppGate:
RelayMiner:
(This number already seems kind of high after runnung e2e tests)
make load_test_simple
AppGate:
RelayMiner:
General deliverables
Creator: @okdas Co-Owners: @red-0ne @okdas