Load Drug Indication Database (CC0)

Looks like plenty of data cleaning will be needed. number of DID_ID: 191111 number of unique raw drug names: 34137 number of unique umls preferred drug terms: 21807 number of raw predicates (not unique, not null): 140181 number of unique umls preferred indication terms: 6111 number of DID entries with a predicate value: 106762 number of DID entries where the predicate is a "marker/mechanism": 42411 number of entries with predicate values that aren't 'marker/mechanism': 62913 number of WD entities pulled by CAS number from DIDs with predicates: 9360 number of WD entities pulled by UMLS "drug" CUIS from DIDs with predicates: 2077 number of WD entities pulled by UMLS "phenotype" CUIS from DIDs with predicates: 1717

SuLab / GeneWikiCentral

Load Drug Indication Database (CC0) #113