Open amhanson9 opened 1 year ago
From reviewing 2023 accessioning data, PDF/A is common enough to potentially be worth extracting version from that pattern as well. Portable Document Format/Archiving (PDF/A-VERSION) OPTIONAL-TEXT
As we add new version patterns for extracting from NARA, can we simultaneously use these patterns to combine FITS name and version, to help with those matches?
Per May 2023 conversation with NARA, they plan to split version number into a separate column in the next 6 months, so this may no longer be necessary.
Location: match_nara_risk()
Description: Currently, anything after the last space in NARA Format Name is used as the version. There are additional formats with versions that do not match this pattern. Two patterns that are after the last space but have additional characters to remove, which might be easy to implement, are "name (version)" and "name v.version".
Priority: waiting to see how often formats with this pattern are in our accessions.