Zhenlisme / heliano

HELIANO is a software for detection and classification of Helitron-like elements.
https://github.com/Zhenlisme/HELIANO/
GNU General Public License v3.0
21 stars 1 forks source link

About noauto helitron detection #6

Open lyysxb opened 1 week ago

lyysxb commented 1 week ago

Hi,

Thank you very much for your open source contribution. I would like to ask how the 'noauto' Helitron is detected in this code, and specifically, in which method is it implemented?

Zhenlisme commented 1 week ago

Hi,

Thank you for using HELIANO.

I am not sure if I understand correctly to your question. To detect nonautonomous Helitron insertions, HELIANO firstly identify the significantly co-occuring LTS-RTS pairs in a given genome. After obtaining the LTS-RTS pairs, HELIANO will recover the nonauto coordination based on the coord of LTS and RTS.

All the best, Zhen

halonlinctos commented 2 days ago

I used HELIANO to test the genomes of peanut, corn and strain Fo5176 mentioned in the example, using the parameters provided in usage heliano -g test.fa -is1 0 -is2 0-o test_opt -w 15000 I have the following questions and would like to ask you:

  1. Among the three genomes, only Fo5176 contains HLE1 and HLE2 in the results, and only HLE1 is found in peanut and corn. As corn is one of the plants with the richest transposon content, why is HLE2 not detected?
  2. I have used helitronscanner to detect several maize assembled genomes. Taking maize Mo17 as an example, 35,819 helitron transposons were detected. However, using your HELIANO, only 430 were detected in the RC.representative.bed file, and 703 were detected in peanuts. These numbers are far lower than helitronscanner. Although the false positive rate of helitronscanner is relatively high, I think this gap is unlikely to be due to this reason. Is there something wrong with my parameter settings?
  3. After obtaining the RC.representative.bed file, is this the final transposon identified? Is there any further operation required?
  4. Frequently asked questions,you say strigent parameter '-pt 1 -is1 1 -is2 1 -p 1e-5 -s 30 -pt 1 -sim_tir 100' ,why are there double ‘-pt’ ,is it a 'Typographical error'?or anything else Looking forward to your response,thanks!
Zhenlisme commented 1 day ago

Best, Zhen

halonlinctos commented 1 day ago

Thank you very much for your reply! I have discussed this with my senior and teacher, and their opinions are exactly the same as your reply. Accurately identifying Helitron transposons is one of the necessary parts of my project. Your tool is very valuable to me. In order to better play its role, if it is convenient for you, can you upload the development version of HELIANO or send it to the mailbox 65520407@qq.com? We want to train the HMM model and try to find some Helitron with less degradation. Thanks!

Zhenlisme commented 23 hours ago

You said you want to find Helitron with less degradation. Do you mean young Helitrons with complete structures? If that is your question, then I don't understand why you need a HMM model to do that job. Anyway, the HMM model (RepHel.hmm) for HELIANO is avaliable within its package. You can also download it from this repository. This model is only used for scanning and classifying Helitron-like transposases.