Open dhimmel opened 1 year ago
i was told by the mesh people once that you can use a regex to figure out if xrefs are to CAS or UNII
@dhimmel pyobo now implements related logic in https://github.com/pyobo/pyobo/blob/8fc402dcfcd089d6e90c2a6c4a4b6a71629d3a33/src/pyobo/sources/mesh.py#L234-L257
MeSH includes some external mappings via the following predicates (from docs):
meshv:registryNumber
: A property of Concepts. A unique identifier from one of these sources: Enzyme Commission (Example: EC 2.4.2.17; Example for Partial enzyme number: EC 1.4.3.-); Chemical Abstracts Service (CAS) (Example: 7004-12-8); FDA Substance Registration System Unique Identifier (UNII) in 10-character format (Example: R16CO5Y76E); or the value of 0 if no match is available from the previous sources. A single MeSH Concept can only have one Registry Number. Used for Concepts related to Descriptors in the D Category Drugs and Chemicals and for SupplementaryConceptRecords. MUI M0000115 example: 362O9ITL9D.meshv:relatedRegistryNumber
: A property of Concepts. An additional unique identifier for chemicals, which is sometimes followed by a label in parentheses. Multiple Related Registry Numbers are allowed for each Concept. For example, these might be salts and/or stereoisomers of the parent compound. Used for Concepts related to Descriptors in the D Category Drugs and Chemicals and for SupplementaryConceptRecords. MUI M0000115 example: 103-90-2 (Acetaminophen). MUI M0068239 example: 75821-71-5 (Ca salt)meshv:casn1_label
: A property of Concepts. Free-text of the Chemical Abstracts Type N1 Name which is the systematic name used in the Chemical Abstracts Chemical Substance and Formula Indexes. The systematic name is a unique name assigned to a chemical substance to represent its structure. First available in 1995. MUI M0000115 example: Acetamide, N-(4-hydroxyphenyl)-Here's a query to access these:
One challenge is that registry numbers appear to be local identifiers without any notation of their source.