oushujun / EDTA

Extensive de-novo TE Annotator
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1905-y
GNU General Public License v3.0
330 stars 72 forks source link

questions about EDTA‘s result files #255

Closed tinyfallen closed 2 years ago

tinyfallen commented 2 years ago

Hi dear developers, Many thanks for your excellent tools! I have some questions.

  1. I noticed the outputs in the path labeled with raw and intact. Looking into the scripts, the *LTR.raw.fa seemed to be used in the following steps. What's the difference between raw.fa and intact.fa ?

image

  1. By running LTR_retriever separately I got a LTR lib differed from that of EDTA. I would like to use the scripts solo_finder.pl in LTR_retriever to obtain the solo LTR and intact LTR, but it seemed different between the LTR_retriever's *out file and the EDTA's version. Which should I trust?

Looking forward to your reply, Best !

oushujun commented 2 years ago

Hi @tinyhys,

LTR.raw.fa is the raw LTR library for EDTA, which is copied from LTRlib.fa generated by LTR_retriever. *LTR.intact.fa contains all intact LTR sequences, which is redundant.

solo_finder.pl works for LTR_retriever outputs and you may need to modify it to work on EDTA outputs.

Shujun

tinyfallen commented 2 years ago

Hi Shujun, Thanks to your reply and I have other questions wishing your comments. Do the solo LTRs still have the ability to transposite? And if some of them do, how can I determine whether a solo LTR remaining the transposition activity or not ? Many thanks!

oushujun commented 2 years ago

Hi @tinyhys,

Good question. The current understanding is not: Solo LTRs are incapable of transposition. But who knows. Since LTR elements are retrotransposons, they require reverse transcription to make new copies. For LTRs to be able to transpose, they would need to transcribe and make mRNAs. That means, to see if a solo LTR is transposable or not, the first evidence is detecting their mRNAs.

Best, Shujun

tinyfallen commented 2 years ago

Dear teacher, thanks a lot for your comment! it solves a concern on my analysis.