ZhouQiangwei / dmtools

BS-seq, WGBS, NOMe-Seq, RRBS data storage and analysis tool
https://dmtools-docs.rtfd.io
MIT License
5 stars 0 forks source link

Question about storage format #2

Closed kdkorthauer closed 8 months ago

kdkorthauer commented 9 months ago

Just a quick question about the storage format. Why not include integer counts of methylated reads instead of a float methylation level? The float value storage takes up more file space and could introduce ambiguities due to rounding that would prevent modeling the WGBS data using count models. Thanks!

ZhouQiangwei commented 8 months ago

Hi, Because we consider that checking the DNA methylation level at specific sites does not require additional computation, especially during visualization. I understand your point, and we will consider adding a storage result that retains only the number of reads.