fiberseq / fibertools-rs

Tools for fiberseq data written in rust.
https://fiberseq.github.io/fibertools/fibertools.html
42 stars 5 forks source link

False methylations called in first 7 or so bp #65

Closed mtcicero26 closed 1 week ago

mtcicero26 commented 1 week ago

Example output (bed) fiber_sequence AAACAAATGCTGAGGCCATTCTCCTGTTCTGCTTCCTGGCACTACG... m6a 0,1,2,4,5,6,21,318,1170,2616,2884,4133,4567,47... ref_m6a -1,-1,-1,-1,-1,-1,-1,33455045,33455907,3345735..

Example command ft predict-m6a -t 8 -k IN.bam OUT.bam

mtcicero26 commented 1 week ago

Also, it should be noted that the -1s in the example ref_m6a output do not consistently show up-- if that part of the read aligned, they will show correct reference coordinates for the false methylations.

mrvollger commented 1 week ago

This will be fixed in the next release and can be tested in this PR #64. This issue seems to be limited to the first and last 7bp in Revio data, and would not have impacted MSP or NUC calling in fibertools since we don't start calling until the 45th bp.

mtcicero26 commented 1 week ago

As a note, I've already accounted for this with FiberHMM, so it should have no effect on footprint calling either.

mrvollger commented 1 week ago

Great news! I am reopening until I merge the fix.