Closed ocxtal closed 3 months ago
Thank you for reporting the trouble. Due to a bug, most of the training data (quality score) for accuracy=97 was replaced with nucleotide sequences. It will take about a week to correct the model. Temporarily removed QSHMM-ONT-HQ.model and ERRHMM-ONT-HQ.model.
We have confirmed that there are no problems with models other than QSHMM-ONT-HQ.model.
Thank you for investigation. For my own evaluation purpose, I've replaced the entire values of 97 EP
with the 96 EP
ones, and the generated sequences look fine.
Please use v3.0.4.
Thanks, I'll try it.
Generated qual strings looked good. Thank you for fixing!
When I used the latest pbsim3 (3.0.2) with the
--qshmm data/QSHMM-ONT-HQ.model
option, I found some of generated reads have strange quality strings where both of sequence and quality look sequence:I investigated the code and the model file, and found columns correspond to
A
,C
,G
, andT
for97 EP
lines inQSHMM-ONT-HQ.model
have much higher values than the other columns of97 EP
. Is this due to a bug in the training script?