Instead of using just cutoff frequencies like varThresh, what if quality scores could be incorporated into the collapsing to use actual probabilities to decide what the base in question actually is. By converting the fastq quality score to a phred score which represents actual probabilities of correct calls, this might be done.
Instead of using just cutoff frequencies like varThresh, what if quality scores could be incorporated into the collapsing to use actual probabilities to decide what the base in question actually is. By converting the fastq quality score to a phred score which represents actual probabilities of correct calls, this might be done.
https://github.com/GabePires/FastQ-Converter http://biopython.org/DIST/docs/api/Bio.SeqIO.QualityIO-module.html https://en.wikipedia.org/wiki/Phred_quality_score https://en.wikipedia.org/wiki/FASTQ_format