PacificBiosciences / pb-CpG-tools

Collection of tools for the analysis of CpG data
BSD 3-Clause Clear License
74 stars 6 forks source link

What is the diffrence between Jasmine and pb-CpG tools? #72

Closed aindap closed 1 month ago

aindap commented 1 month ago

Hi PacBio - I'm still new to HiFi secondary analysis. I'm a little confused between the output of Jasmine and what pb-CpG-tools is doing. I had a pbmm2 derived BAM I pulled from here and the BAM file had the MM and ML tags required for pb-CpG tools. Is Jasmine doing the same thing? What exactly is the difference in the results they are returning?

ctsa commented 1 month ago

pb-CpG-tools is used to summarize methylation at the site level.

indapa commented 1 month ago

Thanks for the explanation @ctsa. I did notice the the HiFi WGS workflow calls pb-CpG-tools to generate the bigWig/BED output with the site level methylation probabilities, which is what I need for my downstream analysis.

Does pbmm2 add the MM/ML tags as well when given an unmapped BAM file with kinetics information? I didn't see Jasmine being used in the WGS WDL workflow.

ctsa commented 1 month ago

The most common scenario is for the methylation calling to have already been run upstream as part of SMRT Link. Jasmine can only be used if you requested BAM output with kinetics tags.