Closed erikyao closed 1 year ago
Dec 13 decision with Chunlei:
Make an intermediate UMLS file containing (UMLSID, EntityName, SemanticTypeAbv, SemanticTypeFullName)
. May server as the source file for future UMLS endpoint.
Dec 14 decision with Colleen:
If a mapped new UMLS ID already appears in the SemMedDB predication CSV file, use its EntityName, SemanticTypeAbv, SemanticTypeFullName
before checking RRFs
Problem
When replacing a retired UMLS ID, its name, semantic type abbreviation/name should be replaced at the same time, but the retired CUI table, as in the
MRCUI.RFF
file of UMLS Metathesaurus, contains only UMLS IDs.E.g. in the source file of SemMedDB predications, the following record
according to
MRCUI.RFF
, should be replaced byBut
MRCUI.RFF
only tells youC0021311 => C0009450
replacement. The new "Name", "SemanticTypeAbv", and "SemanticTypeName" should be filled from other data sources.P.S. the fully replaced record should be like:
Solution
Step 1: UMLS ID => Subject/Object Name
Should be queryable in
MRCONSO.RRF
, file of Concept Names and Sources.However each UMLS ID might have multiple records. Inspired by Example 7 of UMLS Database Query Diagrams,
the filtering condition is
The explanation of other
TS
,STT
, andLAT
values can be found at Abbreviations Used in Data Elements - 2022AB Release. The meaning ofISPREF
is explained at Table 1, UMLS® Reference Manual.CUI names are recorded in the
STR
column.Step 2: UMLS ID => Semantic Type Name
Query
MRSTY.RRF
, file of Semantic Types.Step 3: Semantic Type Name => Semantic Type Abbreviation
Query Semantic Type Mappings.