rki-mf1 / covsonar

A database-driven system for handling genomic sequences of SARS-CoV-2 and screening genomic profiles.
GNU General Public License v3.0
6 stars 0 forks source link

Stats about blind spots (N) when matching genomic profiles #91

Open matthuska opened 1 year ago

matthuska commented 1 year ago

It would be nice to be able to differentiate between three states for each position in the genome:

  1. matches the reference
  2. does not match the reference and is some other nucleotide or a gap
  3. and unknown (=N)

That third case isn't handled well by covsonar, so it would be nice to have some idea when you query for a position, how many sequences have an unknown (=N) at that position. Maybe this information would be included using an optional argument.