niu-lab / msisensor2

Microsatellite instability (MSI) detection for tumor only data.
GNU General Public License v3.0
94 stars 21 forks source link

Hello, I would like to ask a simple question. #15

Open jhs1989 opened 4 years ago

jhs1989 commented 4 years ago

Hello, I would like to ask a simple question.

It is about "output.prefix_dis" among output files in msisensor2.

The following is kindly introduced in README.md.

output.prefix_dis: read count distribution (T: tumor) chr22 29286892 AAAGC 12[T] CTCTT T: 0 0 0 0 0 0 0 0 25 71 4 86 3 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

My question is what tool used to count "read count" and what options to count read count on.

Also, I am wondering why the digit is fixed at 100.

and, The very first 0 means the read count when the repeat length's readcount is 0.

is it correct?

thank you

owehann commented 4 years ago

Hi, thanks for you attention on MSIsensor2. Samtools was used to find each site and then its distribution would be counted. The total length of microsatellites generally does not exceed 50bp, so the fixed length of 100bp is inclusive enough. The very first 0 means the read count when the repeat length's readcount is 1. Thanks