naobservatory / p2ra

4 stars 1 forks source link

pseudocounts: determine fraction of samples this applies to #229

Closed jeffkaufman closed 1 year ago

jeffkaufman commented 1 year ago

For flu we use pseudocounts to handle cases where we'd otherwise have zero values. Make a script that tells us how often this happens.

This also involves teaching Variable how to represent whether it's a pseudocount.

$ ./determine_pseudocounts.py
Influenza A crits_christoph 7 7 100%
Influenza A rothman 36 104 35%
Influenza A spurbeck 36 159 23%
Influenza A brinch 36 159 23%
Influenza B crits_christoph 7 7 100%
Influenza B rothman 57 104 55%
Influenza B spurbeck 57 159 36%
Influenza B brinch 57 159 36%