For Illumina data it has been reasonable to demand that the full allele (all kmers covering the mutation) be detected in reads. For nanopore data, with such a high error rate, this is not appropriate, and we shoud remove this constraint and allow our models to choose for us.
We see examples where sample is S, and we have a lot of covg on the S allele, but not quite 100%
eg the following, where the models would clearly do the right thing if we were not forcing the issue with this threshold that forcibly sets the log likelihood to -9999999.
For Illumina data it has been reasonable to demand that the full allele (all kmers covering the mutation) be detected in reads. For nanopore data, with such a high error rate, this is not appropriate, and we shoud remove this constraint and allow our models to choose for us.
We see examples where sample is S, and we have a lot of covg on the S allele, but not quite 100% eg the following, where the models would clearly do the right thing if we were not forcing the issue with this threshold that forcibly sets the log likelihood to -9999999.