fedarko / strainFlye

Pipeline for analyzing (rare) mutations in metagenome-assembled genomes
BSD 3-Clause "New" or "Revised" License
8 stars 1 forks source link

Output diversity index numerator and denominator as separate TSV entries, rather than as floats? #18

Open fedarko opened 2 years ago

fedarko commented 2 years ago

It shouldn't make a difference, but this way we avoid having to output string representations of floats -- since the numerator (# mutations in sufficiently-covered positions) and denominator (# sufficiently-covered positions) must both be integers. I mean, in order to compute the indices the reader of this file will eventually have to divide these quantities, so there is a tradeoff between ease of use (no need for user to do division) and pathological avoidance of floating-point stuff.

... Probably safe to keep this as is, but may be worth thinking about.