Closed jessicadaugherty closed 1 year ago
Questions
@Gustavobelfort Making a "public note to self" for us to sync on this offline.
Refactored Deliverables
@Gustavobelfort @Olshansk
Currently the state of the health module is as follows:
StateSize
Block Size
API should be able to allow either paginating or opting in/out of the data contained in the blocks (claims, proofs , transactions).
Not clear wether the returned information is "partial" or not (ie: historic vs live query)
Lack of clarity on null returns
Total relays
Total transactions
Session Metrics
Lifecycle Metrics
After discussing with @iajrz we don't think that the module is ready to be merged in the codebase, some unwanted side effects in pocket-core
might pop up if we decide to do so.
Ideally we should review the metric requirements of the health module in order to better design what should be returned, utilize the bits and pieces of the code that are working and either trim out the pieces that do not work or fix them basing ourselves on the design doc created beforehand, only then decide about merging.
Closing this out as the work is no longer relevant. We can reference this PR if we ever choose to pick it up again.
Objective
Following the 0.9.1.1 Chain Halt Post-Mortem, there are action items re: monitoring mainnet (and ideally testnet) that we need to get better coverage of to increase our chances of catching bugs/errors before releases are in production, as well as helping us triage issues in the event of a release/production crisis.
Due to the need to process state dumps and aggregate by network actors, we need a dedicated exporter available rather than making enhancements to an existing exporter.
Origin Document
A health module was designed and built but remained inactive that includes:
Consensus, State and Transaction metrics are the most relevant when triaging crises like a chain halt, while data size and lifecycle metrics help us observe the performance of state size and transitions through the state.
Goals
Deliverable
Non-goals / Non-deliverables
General issue deliverables
Testing Methodology
Creator: @jessicadaugherty