icgc-argo / argo-dictionary

Development of the ARGO Data Dictionary
https://docs.icgc-argo.org/dictionary
GNU Affero General Public License v3.0
2 stars 1 forks source link

Investigate submission of investigational drugs which do not exist in RxNorm #367

Open hknahal opened 1 year ago

hknahal commented 1 year ago

Example: "Dovitinib" and "Celastrol" do not exist in RxNorm.

Investigate:

hknahal commented 1 year ago
Proposed changes that were implemented in MOH data model as well. Field name Data Tier Attribute Type Description Permissible Values
drug_reference_database Core Required Indicate the reference drug database where drug name is found. Text RxNorm, PubChem, NCI Thesaurus
drug_name Core Required Indicate the drug name as it exists in the database specified in the drug_reference_database. Text
drug_reference_identifier Core Required Indicate the identifier from the "drug_reference_database" for the drug. Text

Having generalized fields that are not specific to just RxNorm enable addition of new drug reference databases in the future, should they be required.

Similar to RxNorm, we can use an API for PubChem and NCI Thesaurus to validate the submitted "drug_name" and "drug_reference_identifier" fields.

To validate drug_name and drug_reference_identifier from PubChem:

Example: Field name Value Validation
drug_reference_database PubChem
drug_name Celastrol https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/name/Celastrol/cids
drug_reference_identifier 122724 https://pubchem.ncbi.nlm.nih.gov/compound/122724

To validate drug_name and drug_reference_identifier from NCI Thesaurus:

Example: Field name Value
drug_reference_database NCI Thesaurus
drug_name Allogeneic iC9/CAR19/IL15-transduced CB-NK Cells
drug_reference_identifier C148215

Search "code" column for drug_reference_identifier" and check if "drug_name" matches any of the terms that exist in the "synonym" column.

edsu7 commented 1 week ago

Potential subtickets

Potentially work for DR11