Closed AlicePsyche closed 5 years ago
Hi Alice,
One filter that we do is to remove structural RNA (e.g. rRNA, scRNA, snRNA, srpRNA, tRNA) and simple repeat/low complexity regions (e.g. (TG)n repeats), as we do not consider them as transposable elements. I think that would account for the differences in numbers. A parsed version of the TE GTF for danRer7 is available (using your file as input) here. Let me know if you are still encountering issues with reformatting or with the provided GTF Thanks.
Cheers, Oliver
Many thanks for the prompt reply and the ready-to-use GTF file! Really helps me a lot.
Do any of these filtered GTFs for TEs exist? Because all the links fail for me.
Hi,
Yes, they exist. Unfortunately, our server might be down right now. We'll post here again once they are up.
Thanks.
Hi,
The server is back up, and the files should now be accessible.
Thanks for letting us know.
Hey, the files are unaccessible now again..anyone could help? I need annotation for mouse TEs..
Hi,
Sorry for the interruption. I just want to know if there is any guideline for reformatting downloaded UCSC TE GTF file? I found the discussion here and here but it seems that they are not the full rules? For example, for the danRer10, there are 3565006 lines in the rmsk.txt table from UCSC but only 2548675 in your provided GTF file. So what should I do if I want to deal with other organism TE GTF file? (I have to use danRer7, the file is here )
Thanks for your help in advance!
Best, Alice