NCIP / clinical-trials-search

Other
19 stars 7 forks source link

Biomarker synonyms are not always parsed correctly. #5

Closed dkrylovsb closed 8 years ago

dkrylovsb commented 8 years ago

From QA Engineer:

Some of the Biomarker name that is inside brackets are not displayed properly. I noticed that they are displayed under synonyms category in Json file but that does not list all the names for some Trials. For Trial NCI-2015-00596 the name in PA is displayed as ROS1 (c-Ros Tyrosine Kinase; Proto-Oncogene Tyrosine-Protein Kinase ROS; v-ros avian UR2 sarcoma virus oncogene homolog 1; ROS; c-ros-1; v-ros UR2 sarcoma virus oncogene homolog 1 (avian); MCF3; gp280; c-Ros Receptor Tyrosine Kinase; receptor tyrosine kinase; c-ros oncogene 1)

but in Json it is displayed as { "assay_purpose": "Eligibility Criterion - Exclusion", "long_name": "c-Ros Tyrosine Kinase", "name": "ROS1", "synonyms": [ "c-Ros Tyrosine Kinase", "Proto-Oncogene Tyrosine-Protein Kinase ROS", "v-ros avian UR2 sarcoma virus oncogene homolog 1", "c-ros-1", "ROS", "MCF3", "v-ros UR2 sarcoma virus oncogene homolog 1 " ], "hugo_biomarker_code": null }

so the Json file is not including all the Synonyms in it. [See attach screen shot]

Similarly for NCI-2016-01375 the name in PA is displayed as Vitamin D (25(OH)D; 25 OHD; 25-hydroxy vitamin D) but in Json file it is displayed as

"biomarkers": [
{ "assay_purpose": "Eligibility Criterion - Inclusion", "long_name": "25-Hydroxyvitamin D", "name": "Vitamin D", "synonyms": [ "25" ], "hugo_biomarker_code": null }

],