Open andrejpodzimek opened 2 months ago
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 120 days.
This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 120 days.
@andrejpodzimek I've been working on something that may alleviate your problem. It was done for relays serving hundreds of clients but perhaps it could work here too.
https://github.com/IntersectMBO/cardano-node/tree/karknu/thread_isolation , based on 10.1.2 so will require a chain replay if you're still on 9.2.1. Experimental so best to test it on your backup BP or on a testnet.
Internal/External External
Area Other
Summary Leader log queries impede critical validator processing and cause extreme numbers of missed slot leader checks.
Steps to reproduce
cardano-cli
query in a loop against the validator (example below).Expected behavior Proper resource isolation.
System info (please complete the following information):
cardano-node --version
):cardano-cli --version
):Screenshots and attachments An example query to expose resource isolation problems:
RTS options:
Additional context This case could be dismissed with “use a workaround”, i.e. “have a separate relay node for slot leader queries only”, i.e. not for routing to a validator. However, such an idea is suboptimal, increasing the amount of resources a pool operator must set aside by up to 50%, compared to the simplest relay + validator setup.
The lack of proper resource isolation may have been a contributing factor to my problem of never successfully validating a block, described in this post and above.