Open matentzn opened 3 years ago
hmm. Taking one of these at random 1-hexacosanol
with my older slurp from the ftp, I get
[Term]
id: UMLS:C0080483
name: 1-hexacosanol
xref: MEDGEN:42607
xref: MESH:C051942
subset: Organic-Chemical
subset: Pharmacologic-Substance
synonym: "1-hexacosanol" RELATED [MSH:C051942]
synonym: "n-hexacosanol" RELATED [MSH:C051942]
synonym: "hexacosyl alcohol" RELATED [MSH:C051942]
relationship: RB UMLS:C1563649 {source="MSH"} ! 1-hexacosanol, aluminum (1:3) salt
Any strategy to debug? I have never touched the Medgen slurp code..
I agree this term is there. But check this:
matentzn@mbp:~/ws/mondo/src/ontology (master) $ grep 'C0080483' sources/medgen/medgen.obo
xref: MEDGEN:C0080483
id: UMLS:C0080483
relationship: RN UMLS:C0080483 {source="MSH"} ! 1-hexacosanol
matentzn@mbp:~/ws/mondo/src/ontology (master) $ grep '1-hexacosanol' sources/medgen/medgen.obo
id: UMLS:1-hexacosanol
name: 1-hexacosanol
synonym: "1-hexacosanol" RELATED [MSH:C051942]
relationship: RB UMLS:C1563649 {source="MSH"} ! 1-hexacosanol, aluminum (1:3) salt
name: 1-hexacosanol, aluminum (1:3) salt
synonym: "1-hexacosanol, aluminum (1:3) salt" RELATED [MSH:C051942]
relationship: RN UMLS:C0080483 {source="MSH"} ! 1-hexacosanol
Both seem to be!
What is the source of the slurp from medgen? The docsums or files on the ftp site? Just wondering what NCBI can do to help.
Ftp files
Results in ids like:
There are about 320K correct ids in Medgen and about 2000 such cases. Not sure where the fault lies, maybe with us?