edgi-govdata-archiving / web-monitoring-task-sheets

Experimental new tool for generating weekly analyst task sheets for web monitoring
GNU General Public License v3.0
3 stars 0 forks source link

Analysis fails for pages with no `status` #12

Closed Mr0grog closed 3 years ago

Mr0grog commented 3 years ago

It’s possible for a page to have no status even if it has versions. However, we currently assume that pages with versions will always have a status: https://github.com/edgi-govdata-archiving/web-monitoring-task-sheets/blob/e593aadad9cd93e368d22e96369d8230ea6c1e80/analyst_sheets/analyze.py#L289-L290

If the page has no status, we should probably just use the status code of the latest version.

(See also an issue about making this situation more rare on -db: https://github.com/edgi-govdata-archiving/web-monitoring-db/issues/860)