Currently the program searches for the exact string DS:Z: in the whole line from stdin to restore the information using added optional fields from previous runs of the program. This creates problems when the pattern is present in fields other than optional fields. We encountered this when trying to process a bam file that had one single such problematic read in the whole ~60GB bam. To reproduce the issue, the read is pasted below:
Traceback (most recent call last):
File "pmdtools.0.60.py", line 349, in <module>
PMDS= float(line.split('DS:Z:')[1].rstrip('\n').split()[0])
ValueError: could not convert string to float: Q5YKUPT[]]]LOP[M
when the program is run only with the --deamination option.
Currently the program searches for the exact string
DS:Z:
in the whole line from stdin to restore the information using added optional fields from previous runs of the program. This creates problems when the pattern is present in fields other than optional fields. We encountered this when trying to process a bam file that had one single such problematic read in the whole ~60GB bam. To reproduce the issue, the read is pasted below:outputs:
when the program is run only with the
--deamination
option.