oushujun / LTR_retriever

LTR_retriever is a highly accurate and sensitive program for identification of LTR retrotransposons; The LTR Assembly Index (LAI) is also included in this package.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5813529/
GNU General Public License v3.0
188 stars 40 forks source link

Can LAI be a good metric for contiguous assemblies with a relatively high consensus error rate? #72

Closed chocotwig closed 4 years ago

chocotwig commented 4 years ago

Hi, I have an assembly that happens is relatively contiguous, but generated using error-prone long read sequencing. Our assembly is unpolished, and has a consensus base-level error rate of about 5-7%. In your reckoning, could LAI still be a good metric for assessing this assembly?

Thanks!

oushujun commented 4 years ago

No - LAI relies on accurate sequences because it needs to estimate the divergence of LTR sequences to account for LTR activities. If there are basepair differences between two LTR sequences, LAI can not distinguish whether the difference is due to mutation or sequencing error. And the later one will overcorrect LAI values (low LAI) because the program thought LTRs are old.

Shujun