JBerthelier / PiRATE

PiRATE (Pipeline to Retrieve and Annotate Transposable Elements)
http://doi.org/10.17882/51795
18 stars 5 forks source link

Single Transposable elements library #42

Closed Lukanyo12 closed 3 years ago

Lukanyo12 commented 3 years ago

Hello Berthelier Forgive me for dozens of questions, It's just that I want to make use of this tool as much as I can. Since you were interested in potentially autonomous TEs in your study, and you focused on them mostly. What if for example in my case, my goal is to identify, classify and annotated TEs in genomes of interest. After classification, I want to group my elements as "class I retrotransposons" "class II DNA transposons" and "un-categorized elements" . Following classification, I wish to cluster the TE consensus sequences into families using MCL. Then, create a TE library for respective organism and perform a genome-wide annotation using TEannot pipeline.

JBerthelier commented 3 years ago

Dear Lukanyo12,

Yes this is good, you can build a single TE library for each respective organisms, but I would advise you to only use the classified Class I and Class II in your libraries. I would not use the unclassified, because it could be composed by a large amount of false positives.

Best regards,

Jeremy

Lukanyo12 commented 3 years ago

Thank you! Just a follow up question. how can I determine then the content of for example Class I TEs, Classs II TEs, unknown elements and Total TEs content?