ISUgenomics / SequelTools

new repo
GNU General Public License v3.0
26 stars 6 forks source link

Definition of numReadsLongestSub #11

Open sum732 opened 3 years ago

sum732 commented 3 years ago

Hello Developers,

Thanks for creating the SequelTools.

Just a basic question, how is numReadsLongestSub defined. I tried to look in to the scripts to identity the cuotff, but could not find it out. Are these reads above a certain length, if so how is this decided? If so such hard limits then how is a LongestSub defined?

Thanks, Deep

DavidEHufnagel commented 3 years ago

Hello,

Thank you for your interest in SequelTools and my apologies for the late reply.

numReadsLongestSub is simply the number of longest subreads for a given SMRTcell, where longest subreads are defined as the longest subread in a CLR. So for any template the polymerase will attempt to read it and hopefully will read the subread multiple times. When that works out this produces multiple subreads for one CLR. SequelTools identifies one of those subreads as the longest one. In the case where something like 1.5 or 2.5 subreads are read, which is fairly common, it is very useful to know which is longer and use that one. In some cases multiple subreads have the same exact length in which case SequelTools will choose one randomly to be the longest subread.

Best, David E. Hufnagel

On Wed, Feb 17, 2021 at 12:59 PM sum732 notifications@github.com wrote:

Hello Developers,

Thanks for creating the SequelTools.

Just a basic question, how is numReadsLongestSub defined. I tried to look in to the scripts to identity the cuotff, but could not find it out. Are these reads above a certain length, if so how is this decided? If so such hard limits then how is a LongestSub defined?

Thanks, Deep

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/ISUgenomics/SequelTools/issues/11, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABQPE3MFZVRHPM7QAJU32JDS7QGZ5ANCNFSM4XY43F4Q .