Closed cthoyt closed 8 months ago
Thanks for the issue @cthoyt - yes, this is what I am working on now, in light of the reticence to add a "preferred" prefix in bioregistry. I think using the synonyms will most often work just as well. :) I can theoretically imagine use cases where having synonyms available in the prefixmap will mean we have problems downstream in our KGs (e.g. we need validation to prevent PMID:123456 identified node from being duplicative with Pubmed:123456 identified node).
@sierra-moxon in this case, you simply use the Converter.standardize_curie
which already implements the logic for making sure you don't get multiple equivalent CURIEs based on the synonyms inside the converter.
Would it be possible to propagate the CURIE prefix synonyms from the various sources in the merged context? I had data that was annotated with synonyms, but I wasn't able to use the converter that came out of
prefixmaps
to handle them.Here's an example to illustrate. The Bioregistry lists
pubmed
,PubMed
,pmid
,PMID
, andMEDLINE
as synonyms for PubMed, but only the uppercasedPUBMED
appears to work.