infinite-dao / glean-cetaf-rdfs

Collect and glean RDF data in parallel of stable identifiers of the Consortium of European Taxonomic Facilities (CETAF) and prepare them for import into a SPARQL endpoint
GNU General Public License v3.0
0 stars 0 forks source link

Import Errors — Meise (botanicalcollections.be) #6

Open infinite-dao opened 2 years ago

infinite-dao commented 2 years ago

Counts of error summary (details see: Thread-XX_Meise_2022-May_all_error.log)

Counts Domain Pattern Error Summary
2 https://www.botanicalcollections.be/specimen/CETAF-ID... Codes: ERROR: 404 Not Found;
169 https://www.botanicalcollections.be/specimen/CETAF-ID... Codes: ERROR: 500 Internal Server Error;
121 https://www.botanicalcollections.be/specimen/CETAF-ID... Codes: OK: 303 See Other;ERROR: 500 Internal Server Error;
8 https://www.botanicalcollections.be/specimen/CETAF-ID... Codes: OK: 303 See Other;ERROR: No data received.;OK: 200 OK;
1 http://www.botanicalcollections.be/specimen/CETAF-ID... Codes: OK: 302 Found;ERROR: 500 Internal Server Error;
2 http://www.botanicalcollections.be/specimen/CETAF-ID... Codes: OK: 302 Found;OK: 303 See Other;ERROR: 500 Internal Server Error;
infinite-dao commented 1 year ago

I collected all URIs we have gathered so far, the URIs with 500 Internal Server Error can be gathered again with no problems.

Presently 138 URIs remain unclear with 404 Not Found, we gathered them earlier (one or two years before) but now they are not reported any more on GBIF:

Counts Domain Pattern Error Summary
138 https://www.botanicalcollections.be/specimen/CETAF-ID... Codes: ERROR: 404 Not Found;
infinite-dao commented 1 year ago

Missing Identifiers ~ Lost in space …

An interesting question raises here: what to do with stable identifiers that became missing — because normally this should not happen with stable persisting identifiers :grin: …

Possible proper solutions: