Closed stevekm closed 7 years ago
Further parsing down the Excel table may not be helpful since the tumor and tissue type information is still not available for matching with interpretations. Once this information is available, it may be possible to still successfully parse the table for matching entries without further parsing needed.
In order to integrate clinical interpretations of variants from the WC PMKB database (here: https://pmkb.weill.cornell.edu/therapies/download.xlsx , https://pmkb.weill.cornell.edu/), the provided Excel sheet needs to be parsed to make searching for variants easier. Current scripts only provide TSV output format, but scripts need to be extended to split the dataset so each individiual variant has a single row, and the variant ID's match ANNOVAR output format.