xiezhq / ISEScan

A python pipeline to identify IS (Insertion Sequence) elements in genome and metagenome
Apache License 2.0
79 stars 17 forks source link

Inaccurate Boundary Determination of IS Elements #57

Open 2021JohnSheng opened 2 months ago

2021JohnSheng commented 2 months ago

Hi, I would like to commend you on creating an excellent tool. It has been invaluable in my research. However, I have encountered a specific issue that I believe warrants your attention.

During my use of the tool, I have observed numerous instances of nested IS elements. For example, an identified IS element (denoted as A) is often found within another identified IS element (denoted as B). In other words, B encloses A.

This nested structure raises a critical question: which coordinates (start and end) should be considered definitive for the true IS element? It is apparent that both A and B contain a genuine IS element, but the overlap creates ambiguity regarding the accurate boundaries.

Could you please provide clarification on the methodology used by the tool to determine the boundaries of the true IS element in such nested scenarios? Any insights or recommended approaches to resolve this issue would be greatly appreciated.

Thank you for your attention to this matter. I look forward to your guidance.

Best regards, Yong Sheng