rchain-community / mainnet-outage

0 stars 0 forks source link

last block number tool #19

Open Bill-Kunj opened 1 year ago

Bill-Kunj commented 1 year ago

When determining the reason for an outage, it is important to know the last block number of every validator in the network. The command curl https://node25.root-shard.mainnet.rchain.coop/api/blocks|jq '.[0].blockNumber' will show the last block number, and we need a tool that automates this based on the size of the shard and/or observers. This data can then be fed into prometheus for monitoring and alerting by grafana

rhoast-tools can be used for this. See @Valentine for details

DPMBarnes commented 1 year ago

Andrew and Will to discuss

DPMBarnes commented 1 year ago

@azazime to lead this one and ask Will (or others) for help if he needs it.

Bill-Kunj commented 1 year ago

@azazime Where are we on this?

azazime commented 1 year ago

i'm in contact with @zsluedem on this

azazime commented 1 year ago

i created a bash script [filename: tools] that helps check the last block number, you can check the rhoast client directory to see this. i did a pull request remotely here on GitHub because for some reasons i couldn't push it directly from my shell. https://github.com/rchain-community/rhoast-client/tree/azazime-patch-4