Open dhimmel opened 3 years ago
This issue is introduced when we're integrating Metazoa Species data from Ensembl through BioMart.
File path: ensembl_metazoa/49/gene_ensemblgenemain.txt text based search: awk '$2 == "263" { print $0 }' gene_ensemblgenemain.txt returns: 27923 263 rns 3153 3520 Mt 1 rRNA
And since no entrezgene id can be mapped to it. We use it as the _id. And it accidentally aligns with the genedoc with _id:263 from entrez for human species.
I've noticed three genes where the value for
ensembl.gene
does not begin withENSG
:In these cases, it seems the value for
ensembl.gene
has been set toentrezgene
(the ncbigene id). Any ideas what the problem is?