phac-nml / biohansel

Rapidly subtype microbial genomes using single-nucleotide variant (SNV) subtyping schemes
Apache License 2.0
26 stars 7 forks source link

Quality assessment module #14

Closed Takadonet closed 6 years ago

Takadonet commented 7 years ago

Create a new module that will evaluate the results that hansel has generated for the following 2 user cases.

  1. Mixed subtypes (inconsistent subtyping results)
  2. Insufficient number of SNV targets found

Mixed subtype would be flagged if more targets are found than the total expected number for the scheme. (both the negative and positive tile for a given site are found) . Try out with 3 or more excess targets tile that were suspicious.

Insufficient number of SNV should be flag if more then 5% of the targets missing based on the scheme. Example for SH scheme that means if less then 180 tiles are missing, flag it out of 188.