Closed marc-sturm closed 4 years ago
Sorry for the delay in responding. Thanks for pointing this out. Our build system uses external sources from the NCBI (medgen) to asociate genes to diseases. We have also occasionally noted errors and report them to NCBI medgen, who usually fix promptly. I have checked some of these errors as listed above and some but not all seem to be resolved now. Could I ask you to report errors here? https://www.ncbi.nlm.nih.gov/medgen/docs/help/
Hi,
the problem still exists: COX1 ERROR: COX1 is a synonymous symbol of the genes MT-CO1, PTGS1 COX2 ERROR: COX2 is a synonymous symbol of the genes MT-CO2, PTGS2 H19-ICR ERROR: H19-ICR is unknown symbol HBB-LCR ERROR: HBB-LCR is unknown symbol ND1 ERROR: ND1 is a synonymous symbol of the genes IVNS1ABP, MT-ND1 TRNP ERROR: TRNP is a synonymous symbol of the genes MT-TP, TRNP1 WHCR ERROR: WHCR is unknown symbol
I'm really not sure what to report to NCBI medgen, since I don't use it myself and cannot even give them an example of a API query that gives invalid results. I think you should report back to them. Also because it would have more weight.
Best, Marc
@iimpulse thanks, Marc, we will try to track this down and we will report this to NCBI as appropriate.
I have reported this to medgen. This is all we can do at the moment. Assuming they correct their files, our downstream files will reflect the information in the next release.
Hi,
We noticed that some of the gene names listed in the artefacts are outdated.
Some are just previous symbols, which is no big problem (the gene name listed in HPO is the second gene name, the first gene names is the HGNC-approved name):
However, some gene names cannot be converted to HGNC-approved names, which makes the information hard to use:
The output shown here was created using this command:
GenesToApproved is part of ngs-bits
Best, Marc