-
The `Version` model has an `after_create` callback that updates its page’s `title` field: https://github.com/edgi-govdata-archiving/web-monitoring-db/blob/213dc89341a59d975eae35e438a0c7953820b61b/app/…
-
Ex:
- https://github.com/edgi-govdata-archiving/wayback/blob/024a80ec0eae44fcf64e3d834d5f43c78937b8fb/wayback/_models.py#L253
- https://github.com/edgi-govdata-archiving/wayback/blob/024a80ec0eae44f…
-
https://data.europa.eu/data/datasets?locale=en&minScoring=0&query=&page=1&catalog=govdata
![image](https://user-images.githubusercontent.com/5406352/184647523-f6aff5f6-810c-4916-8510-55838329df3d.p…
-
`WaybackClient.get_memento` has left-over rate-limiting behavior from web-monitoring-processing: https://github.com/edgi-govdata-archiving/wayback/blob/f1cdb1dc2692b4215f00a2c05c53fe9c733b24f9/wayback…
-
It looks like something has changed about either Requests or the Wayback Machine, and we are no longer including all the original archived headers in a Memento object’s `headers` property. For example…
-
https://www.govdata.de/
![image](https://user-images.githubusercontent.com/5406352/184641096-a0c14def-ae8f-4fba-b667-55afa468fe9d.png)
-
https://www.govdata.de/web/guest/daten
![image](https://user-images.githubusercontent.com/5406352/184641390-9cb90a0d-69a7-49e5-857e-fc775ed21e26.png)
-
Some sites have error pages that don’t respond with actual error status codes (i.e. they have a `200` status code instead of `>= 400`). For example: https://www.eia.gov/tools/models/datatools.cfm
S…
-
Last week, I made a bunch of changes to our database’s configuration (edgi-govdata-archiving/web-monitoring-ops#26) and to the indexes for the versions table (#548) to address massive performance issu…
-
Dataset metadata records generated from the geoportal must, in addition to the harvest to GDI-DE and INSPIRE, also be available for GovData (OpenData). Here it seems that an older licence code is stil…