bio-guoda / preston

a biodiversity dataset tracker
MIT License
25 stars 1 forks source link

introduce no-access badge to indicate known DwC-A registration but non accessible content #273

Closed jhpoelen closed 6 months ago

jhpoelen commented 6 months ago

inspired by https://discourse.gbif.org/t/assessing-fairness-of-biodiversity-data-through-badges-and-download-buttons/4246/6 , in which a dataset was registered with GBIF, but their DwC-A endpoint URL was not accessible to retrieve the content.

Previously, "FAIR" and "unknown" badges were produced.

Suggested is to produce "FAIR", "no access" and "unknown" DwC-A badges.

"FAIR" - content associated with provided identifier is findable (via GBIF/iDigBio), accessible (via registered endpoint), interoperable (by using DwC-A) and re-usable (can be moved and redirected to).

"no access" - content associated with provided identifier is findable (via GBIF/iDigBio), but could not be accessed via the registered endpoint.

"unknown" - provided identifier has unknown associations with desired content (e.g., DwC-A) in the current provenance graph.

jhpoelen commented 6 months ago

a first version of a no-access badge was introduced via v0.7.16

for example, see attached

image

Also note that the http response header contains provenance information on the requested resource, even if the associated content would not be accessed.

curl -I https://linker.bio/badge/10.15468/mwjnku

yielded -

HTTP/1.1 404 Not Found
Server: nginx/1.18.0 (Ubuntu)
Date: Wed, 10 Jan 2024 19:18:56 GMT
Content-Type: image/svg+xml
Content-Length: 1301
Connection: keep-alive
Content-Location: https://registry.nbnatlas.org/archives/dr940/dr940.zip
X-UUID: urn:uuid:926f5a1c-8995-498a-913b-fe0312e1071f
X-DOI: https://doi.org/10.15468/mwjnku
X-PROV-wasInfluencedBy: urn:uuid:926f5a1c-8995-498a-913b-fe0312e1071f https://doi.org/10.15468/mwjnku
X-PROV-wasGeneratedBy: urn:uuid:e793c22e-7ffa-4034-835b-f10b27b2efc1
X-PROV-generatedAtTime: 2023-12-03T09:46:45.012Z
X-DC-format: application/dwca
X-PROV-hadPrimarySource: hash://sha256/5b7fa37bf8b64e7c935c4ff3389e36f8dd162f0705410dd719fd089e1ea253cd
Access-Control-Allow-Origin: *
Access-Control-Allow-Methods: GET, OPTIONS
Access-Control-Allow-Headers: DNT,User-Agent,X-Requested-With,If-Modified-Since,Cache-Control,Content-Type,Range
Access-Control-Expose-Headers: Content-Length,Content-Range