Closed terrycojones closed 6 years ago
This seems to only be an issue with the protein version of RVDB. I'll ask them. Feel free to close this.
I've mailed Marc & Thomas about this. Closing.
They are endogenous (retro)virus (EN(R)V) sequences existing in their host's genome. We are working on the solutions to determine the sequences belong to host or EN(R)V for those particular cases.
Thanks @ArifaKhanLab !
Hi. First of all, thanks so much for all this work, it looks like a very promising middle ground between the small selection of the refseq database and the unruly enormous
nt
database.I have found a couple of cases of what should be virus names (in
[
...]
at the end of the sequence ids) that are actually species names.and the same occurs for a
grep
on[Homo sapiens]
, though with many more hits:Regards, Terry