oushujun / LTR_retriever

LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons; The LTR Assembly Index (LAI) is also included in this package.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813529/
GNU General Public License v3.0
177 stars 40 forks source link

The total number of LTR in Arabidopsis #60

Closed wzhjlau2009 closed 4 years ago

wzhjlau2009 commented 4 years ago

Hi , Shujun Thanks for so powerful tools, and recently i use LTR_retrieve to identify the LTR-RT from LTR_finder and harvest in Arabidopsis(TAIR10) genome , it works well and i get the results. But only 184 intact LTR_RT are identified in the pass.list file. I think the number is maybe too small for whole Arabidopsis genome. I will very happy if you give me some advice about what led to this results ? Thanks! TAIR10.fa.mod.pass.list.txt

oushujun commented 4 years ago

Hi @wzhjlau2009 ,

184 is a lower count of intact LTR-RTs I've got from the TAIR10 genome, but not surprisingly low. In a recent test (Aug 27, 2019) I got 194 intact LTR-RTs. To obtain raw candidates what parameters did you use?

Best, Shujun

wzhjlau2009 commented 4 years ago

Hi @Shujun Thanks for your reply. You got 194 intact LTR-RTs , did it mean that only 194 intact LTR-RTs in the whole Arabidopsis genome ? Here is my parameters. image

oushujun commented 4 years ago

Yes you are right. Only ~200 intact LTR elements in the Ath genome, which has very little LTRs indeed. Your parameters looks ok. You may get slightly more if lower similarity to 85%.

Best, Shujun

On Thu, Nov 28, 2019, 8:32 AM wzhjlau2009 notifications@github.com wrote:

Hi @shujun https://github.com/shujun Thanks for your reply. You got 194 intact LTR-RTs , did it mean that only 194 intact LTR-RTs in the whole Arabidopsis genome ? Here is my parameters. [image: image] https://user-images.githubusercontent.com/6326115/69814307-e18aff80-122e-11ea-99dc-8559ed0cb6b3.png

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/oushujun/LTR_retriever/issues/60?email_source=notifications&email_token=ABNX4NGHCOXXRK7JSZOLCWLQV7I7TA5CNFSM4JREBFH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEFMYR7Q#issuecomment-559515902, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABNX4NDLDGRWPXQ6IRU7VW3QV7I7TANCNFSM4JREBFHQ .

oushujun commented 4 years ago

Hi @wzhjlau2009,

I recently identified a bug for dropping high-quality intact LTR elements. I fixed it and it could recover 5% more sensitivity for genomes with low LTR contents like Arabidopsis. The version is upgraded to v2.8 now. I identify 214 unique intact LTR elements with this version and 193 for v2.7.

I am drafting a new release with benchmarking results, please check it out when it becomes available.

Best, Shujun

wzhjlau2009 commented 4 years ago

Hi @oushujun , Thanks very much. By the way , How can i identified for all the LTR elements including solo LTR ? By RepeatMasker for denovo genome ? Thanks for your advice!

Best Wishes!

oushujun commented 4 years ago

By default the program will perform whole genome annotation, which is basically RepeatMakser using the generated library with some extra filtering. To find solo LTRs, you may use the soloLTR script in the bin folder.

Best, Shujun

On Thu, Dec 5, 2019, 2:31 AM wzhjlau2009 notifications@github.com wrote:

Hi @oushujun https://github.com/oushujun , Thanks very much. By the way , How can i identified for all the LTR elements including solo LTR ? By RepeatMasker for denovo genome ? Thanks for your advice!

Best Wishes!

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/oushujun/LTR_retriever/issues/60?email_source=notifications&email_token=ABNX4NEDHQWLIG77QFBKW4LQXC365A5CNFSM4JREBFH2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEF75JSQ#issuecomment-562025674, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABNX4NG3TSB2TTTD3BVC6QTQXC365ANCNFSM4JREBFHQ .

wzhjlau2009 commented 4 years ago

Hi, @oushujun I see,recently i want to identify all kinds of LTRs in a denovo assembly genome. Your suggestions will help me a lot. Thank you very much!

Sincerely!

oushujun commented 4 years ago

I consider this issue resolved. Please reopen it if you find out it is not or open new issues.