Closed Mr0grog closed 4 years ago
The single best human-readable metric I can think of is a histogram of Versions per Page added in a given day.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.
Still relevant, and something we should do alongside @danielballan’s upcoming metrics plans!
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in seven days if no further activity occurs. If it should not be closed, please comment! Thank you for your contributions.
While we can try and write our importing tools to throw and notify about errors as much as possible, there can always be silent failures we miss, e.g. https://github.com/edgi-govdata-archiving/web-monitoring-versionista-scraper/issues/51
It might be a useful sanity check to regularly make sure new versions are being added to the DB. Some approaches:
I’m thinking (1) is probably the best bet here since it is least hobbled by anything that could happen in an import script and because it can happen in one place, but flagging this here rather than on the DB project in case there are other opinions.
Any thoughts, @danielballan?