Closed mbhall88 closed 2 years ago
Here are the results.
sample | guppy v3.4.5 | guppy v3.6.0 | tubby v3.6.0 | guppy v5.0.16 |
---|---|---|---|---|
mada_117 |
R | S | S | S |
mada_118 |
R | S | S | S |
mada_124 |
R | S | S | S |
R and S refer to isoniazid predictions. These samples have a phenotype of S and Illumina genotype of S.
Whilst all of the most recent guppy
/tubby
models get the right call, there is a bit of a difference in the coverage on alleles.
For example, in the mada_124
example indel above, if we look at katG_CC1039C
in the different models we get the following (median) coverage.
Ref | Alt | |
---|---|---|
guppy v3.4.5 | 8 | 19 |
guppy v3.6.0 | 27 | 16 |
tubby v3.6.0 | 41 | 18 |
guppy v5.0.16 | 38 | 31 |
Sadly I do not have a guppy v5.0.16 tubby model yet to compare how that performs.
@iqbal-lab So the question is: which model do we proceed with?
On the one hand, tubby gives slightly better coverage in this example. However using guppy makes the paper methods etc. much simpler....
I am pretty amazed the problem just gets fixed by updating guppy! i would vote for keeping this simple and using guppy in this paper. (Still quite a lot of covg on the Alt in all cases, hard to avoid i guess)
Homopolymer deletions are causing (
mykrobe
) FP resistance calls in katG (isoniazid).An example from a
mykrobe
report (mada_124
)The corresponding Illumina report has no support for these indels.
Currently, the underlying Nanopore data is basecalled with
guppy
v3.4.5. So, the first step is to test whether newer versions ofguppy
andtubby
remove these errors or not.To test this, there are six samples with these indel issues in katG
mada_124
mada_129
mada_118
mada_140
mada_148
mada_117
Rather than re-basecall all of these samples, we will first test different versions on three samples from the same Nanopore run to reduce the amount of basecalling we need to do:
mada_117
,mada_118
, andmada_124
.Checklist
Basecall and run
mykrobe
with model versions:guppy
v3.6.0guppy
v5.0.16tubby
v3.6.0