outbreak-info / outbreak.info-resources

A curated repository of metadata of resources on COVID-19 and SARS-CoV-2
MIT License
0 stars 4 forks source link

[DATASET] Zenodo id resolution failure #179

Closed gtsueng closed 2 years ago

gtsueng commented 3 years ago

Zenodo issues different record ids for different versions of a record. Our API appears to only provide the ID from the latest version. This means that if you pull zenodo records from our API at one point in time, many of these record ids will not be usable in the future as only the record id for the latest version is available.

Example: zenodo.3855135 was pulled from our API in an old dump of the datasets available via outbreak.info/resources This id no longer resolves to anything, even though the entry still exists as record zenodo.3898626 in our API.

Zenodo records allow these versioned records to map to an 'all version' or 'non-version' record (https://zenodo.org/record/3855135). Would be nice if the API could do something similar instead of just returning nothing.

gtsueng commented 2 years ago

Should have been resolved by https://github.com/biothings/biothings.crawler/pull/11