biothings / semmeddb

1 stars 1 forks source link

Piped CUIs in `semmedVER43_2022_R_PREDICATION.csv` #3

Closed erikyao closed 1 year ago

erikyao commented 2 years ago

In semmedVER43_2022_R_PREDICATION.csv, there exist SUBJECT_CUI or SUBJECT_CUI values like C1419460|6060|26822. We call such values "piped CUIs".

Within piped subject CUIs,

count of pipes in a CUI count of CUIs
1 31404
2 2562
3 795
4 413
5 252
6 122
7 82
8 46
9 23
10 14
11 2
12 6
13 175
36 1

Within piped object CUIs,

count of pipes in a CUI count of CUIs
1 29273
2 2306
3 757
4 383
5 223
6 120
7 68
8 37
9 22
10 12
11 2
12 7
13 149
36 1

Also note that such piped CUIs may contain Entrez IDs only. E.g. 8536|57118, which appears in the following rows.

PMID PREDICATE SUBJECT_CUI SUBJECT_NAME SUBJECT_SEMTYPE SUBJECT_NOVELTY OBJECT_CUI OBJECT_NAME OBJECT_SEMTYPE OBJECT_NOVELTY
24626179 NEG_CAUSES 8536|57118 CAMK1|CAMK1D gngm 1 C1160474 egg activation celf 1
24825433 AUGMENTS 8536|57118 CAMK1|CAMK1D gngm 1 C0034850 Endosomes celc 1
12967475 ASSOCIATED_WITH 8536|57118 CAMK1|CAMK1D gngm 1 C0024301 Lymphoma, Follicular neop 1
18620088 INTERACTS_WITH 8536|57118 CAMK1|CAMK1D gngm 1 C0029418 Osteoblasts cell 1
18620088 INTERACTS_WITH 8536|57118 CAMK1|CAMK1D gngm 1 C0085301|2353 Proto-Oncogene Proteins c-fos|FOS aapp 1
18620088 AFFECTS 8536|57118 CAMK1|CAMK1D aapp 1 C1266909 Entire bony skeleton bdsy 1
22903836 AFFECTS 8536|57118 CAMK1|CAMK1D gngm 1 C0795806 chromosome 3p deletion syndrome dsyn 1
28805296 PART_OF 8536|57118 CAMK1|CAMK1D gngm 1 C0015576 Family humn 1
29876008 INTERACTS_WITH 8536|57118 CAMK1|CAMK1D gngm 1 C0017725 Glucose bacs 1
29876008 INTERACTS_WITH 8536|57118 CAMK1|CAMK1D gngm 1 C0021641 Insulin phsu 1
29165597 INHIBITS 8536|57118 CAMK1|CAMK1D gngm 1 C1415328|2938 GSTA1 gene|GSTA1 gngm 1

3 Cases: