Closed ronaldtse closed 5 years ago
Please wait until I review the NIST JSON in #23 before proceeding with this.
Review in #23 done. There some non-critical information missing, but I think mapping this to Relaton XML is going to be straightforward. This can proceed.
@ronaldtse should we include the JSON file into this gem? If so would be the JSNO file updated from time to time?
UPD Oh, I see it's updated daily. How do we suppose to update the file locally? We can check if the creation date of the file is less than today then we load new file. In this case, we shouldn't save the file in the repository. Is this ok?
We should:
Cache the full JSON file globally
What do you mean?
Cache full JSON, for example, store the JSON in ~/.relaton/nist/csrc.json
, so subsequent fetches in the same day won't need to re-download.
@ronaldtse in the JSON file we don't have:
Is it ok if we don't have this information in relaton XML?
Yes, with the URI and DOI we don’t need the PDF link for now.
For abstract and history, they may be added later to this JSON. Thanks!
@ronaldtse how should we map parts of name from JSON to bib model?
title => prefix
givenName => forename
middlename => ?
surname => surname
suffix => addition
nickname => ?
@andrew2net currently we don't have a superset Relaton model. We should somehow preserve this data for when Relaton "upgrades" its way to handle contributor information properly.
Thanks to the excellent work at the CSRC, they have a new bibliographic feed made for Metanorma in the Relaton-like format. This feed is updated daily.
The feed content also differs from the CSRC search site -- it is a full superset of it -- e.g. it provides all drafts with the correct statuses.
Metadata
https://csrc.nist.gov/CSRC/media/feeds/metanorma/pubs-export.meta
This describes attributes of the ZIP file.
JSON data
https://csrc.nist.gov/CSRC/media/feeds/metanorma/pubs-export.zip
This is the JSON file that contains bibliographic data of all NIST SPs and FIPSs, at only 793K it is way better than just scraping the search site.
Here's a sample entry.
Actions
Thanks!