mskcc / vcf2maf

Convert a VCF into a MAF, where each variant is annotated to only one of all possible gene isoforms
Other
371 stars 216 forks source link

ensg_to_entrez_id_map_ensembl_feb2014.tsv erroneously assigns Entrez ID #341

Open gjoseph15 opened 1 year ago

gjoseph15 commented 1 year ago

we are using vcf2maf prior to import to our local cBioPortal instance. We encounter a warning from cBioPortal's data validator that the MAF data contains Hugo Symbol:Entrez ID combinations that are incorrect.

Upon deeper inspection, the "ensg_to_entrez_id_map_ensembl_feb2014.tsv" file contains multiple instances where several Entrez ID's trace back to the same ENSG#, but no additional columns exist in the file to successfully pair the Hugo symbol to the specific Entrez from the cluster and so the pipeline incorrectly assigns the values.