Open wumirose opened 1 year ago
@wumirose can you provide a bit more context? Where did you find these identifiers and what is the effect of these not normalizing? That will help us prioritize this work.
@wumirose can you provide a bit more context? Where did you find these identifiers and what is the effect of these not normalizing? That will help us prioritize this work.
I added the sources and a few more details. I hope it helps.
These ReferenceSequences are not normalizing
The EntityWithAccessionedSequence has 4 types- [Protein; Gene and Transcript; DNA Sequence; and RNA Sequence]. All except the RNASequence and a few DNASequence/Gene normalized with UniProtKB, ENSEMBL:ENSG, .... prefixes. We need to include other unnormalized identifiers with ENSEMBL:ENSOCUP.., ENSEMBL:ENST.., RNAcentral:...., prefixes.