oushujun / LTR_retriever

LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons; The LTR Assembly Index (LAI) is also included in this package.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813529/
GNU General Public License v3.0
176 stars 40 forks source link

LTR insertion time #82

Closed liulu0827 closed 3 years ago

liulu0827 commented 3 years ago

Hi, shujun,

I am trying using ltr_retriever to calculate LTR insertion time. Here is the command I used: LTR_retriever -genome $ASSEMBLY -inharvest $ASSEMBLY.harvest.scn -infinder $ASSEMBLY.finder.scn -threads 40

However, according to the result of mod.pass.list, the burst of LTR occurred at ~0.2 Ma, it's weird (Fig. 1). As I know, the common burst of LTR is at 1-5 Ma. Especially, a close relative of my species is ~1.8 Ma. Is there any important parameters I should pay attention to? or do I misunderstand this result? I am really confused with this result.

image

Fig. 1

By the way, I also used EDTA. The result is similar in the file .mod.EDTA.raw/LTR/.mod.pass.list. Here is the command I used: ./EDTA.pl -genome $ASSEMBLY -species others -step all -evaluate 1 -sensitive 1 -anno 1

Best,

Lu Liu.

oushujun commented 3 years ago

Hi Lu,

First thing you may need to set the mutation rate of your species with -miu or at least using the same mutation rate for the two analyses to be comparable.

If you have done this, then pay attention to what LTR elements are used in the two analyses. In LTR_retriever, only intact LTR elements are used. If you include more fragmented or much-mutated elements, the peak may shift.

Then, please double check if the two analyses were based on the same genome. If not, please check the quality of both genomes because the LTR assembly is usually depleted in poorly assembled genomes. You may check out the LAI metric which was developed to evaluate the assembly quality of the TE space. Below is an example of how LTR age distribution could be shifted due to genome quality. Similarily, error polishing will affect the age distribution because sequencing errors are counted as mutations for age calculations.

image

Lastly, based on my experience, an intact LTR age peak on 0.2 MY is VERY common to a degree that if you don't see this peak, you should question the quality of your genome. Based on your description, your genome is not weird at all.

Best, Shujun

liulu0827 commented 3 years ago

Hi Shujun,

Thanks for your reply.

I checked all you mentioned, the mutation rate, intact LTR and the quality of genomes. The LAI score of my genome assembly is 18. The others you mentioned have no problems except of the LTR insertion of relative species. The LTR insertion of relative actually occured from 1.8 Mya, not has a peak on 1.8 Mya. And I analysed the genome of the relative, the result was similar with my species genome.

So I think it's a normal result of 0.2 Mya, just I don't see this peak before.

Thanks a lot, Lu Liu

amvarani commented 2 years ago

Hi there, Sorry, I know that this is a closed topic, but I would like to ask which script you used to make these plots of the LTR insertion time ?

oushujun commented 2 years ago

Hi @amvarani,

You may try the ggplot2 package in R. I usually make geom_histogram or geom_density plots for LTR insertion time.

Best, Shujun

Jiangjiangzhang6 commented 1 year ago

hello, shujun, I draw the plot like this image my LAI was 24, gap-free , so its normal?

oushujun commented 1 year ago

@Jiangjiangzhang6, yup, looks like a typical plant genome in high quality and with abundant LTR activities in the scale of evolution time.

Jiangjiangzhang6 commented 3 months ago

Lastly, based on my experience, an intact LTR age peak on 0.2 MY is VERY common to a degree that if you don't see this peak, you should question the quality of your genome. Based on your description, your genome is not weird at all

sorry for that, its a late answer, I saw the front the relationshio of LAI and the ltr insertion time, the LAI higher and the peak of insertion time near zero. I just has the puzzled about that, if the situation was that, the ltr insertion time estimated was just with no meanning looking forward to your reply