oushujun / EDTA

Extensive de-novo TE Annotator
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1905-y
GNU General Public License v3.0
336 stars 73 forks source link

Whether curatedlib will be processed (remove redundancy and filter) before generating Final TE library used for annotation? #345

Closed Taoo777 closed 1 year ago

Taoo777 commented 1 year ago

Hi @oushujun! firstly, thank you for establishing EDTA pipeline, it's very useful for TEs annotation of non-model organism genome and has helped me a lot.

I have a small question about curatedlib processing. I wonder whether the curated library that I added will be processed together with RepeatModeler results to remove redundancy and filter and then generate the Final TE library used for annotation? Or does EDTA do nothing to input curated library and directly merge it with RepeatModeler results to generate Final TE library?

Looking forward to your reply!

oushujun commented 1 year ago

Hello,

Thank you for using EDTA. For the curated library, it will be directly merged into the final library, so be cautious about what you put in it. You will need to format the curated library as in the RepeatMasker library format to be recognized by EDTA/Repeatmasker. For the EDTA library or the RepeatModeler library, their redundancy will be removed by comparing to the curated library if provided.

Thanks, Shujun