edgi-govdata-archiving / web-monitoring

Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
Creative Commons Attribution Share Alike 4.0 International
105 stars 17 forks source link

Add sanity check that we are regularly adding new versions #80

Closed Mr0grog closed 4 years ago

Mr0grog commented 7 years ago

While we can try and write our importing tools to throw and notify about errors as much as possible, there can always be silent failures we miss, e.g. https://github.com/edgi-govdata-archiving/web-monitoring-versionista-scraper/issues/51

It might be a useful sanity check to regularly make sure new versions are being added to the DB. Some approaches:

I’m thinking (1) is probably the best bet here since it is least hobbled by anything that could happen in an import script and because it can happen in one place, but flagging this here rather than on the DB project in case there are other opinions.

Any thoughts, @danielballan?

danielballan commented 6 years ago

The single best human-readable metric I can think of is a histogram of Versions per Page added in a given day.

stale[bot] commented 5 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.

Mr0grog commented 5 years ago

Still relevant, and something we should do alongside @danielballan’s upcoming metrics plans!

stale[bot] commented 5 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.

stale[bot] commented 4 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.