Open Bill-Kunj opened 1 year ago
Andrew and Will to discuss
@azazime to lead this one and ask Will (or others) for help if he needs it.
@azazime Where are we on this?
i'm in contact with @zsluedem on this
i created a bash script [filename: tools] that helps check the last block number, you can check the rhoast client directory to see this. i did a pull request remotely here on GitHub because for some reasons i couldn't push it directly from my shell. https://github.com/rchain-community/rhoast-client/tree/azazime-patch-4
When determining the reason for an outage, it is important to know the last block number of every validator in the network. The command
curl https://node25.root-shard.mainnet.rchain.coop/api/blocks|jq '.[0].blockNumber'
will show the last block number, and we need a tool that automates this based on the size of the shard and/or observers. This data can then be fed into prometheus for monitoring and alerting by grafanarhoast-tools
can be used for this. See @Valentine for details