arq5x / poretools

a toolkit for working with Oxford nanopore data
MIT License
239 stars 90 forks source link

quality scores in FASTQ vs FAST5 #146

Open aarthi-mohan opened 7 years ago

aarthi-mohan commented 7 years ago

Hi,

I am new to MinION sequencing analysis and using albacore to basecall 1D reads. I used porteools to extract fastq records and generated FastQC reports. For many sequences the "Per base sequence quality" from FastQC is "20" and the _"meanqscore" attribute in the basecalled group of the FAST5 file is "13".

In Figure 1 from http://www.biorxiv.org/content/biorxiv/early/2015/05/13/019281.full.pdf paper, the quality score also range between 0-13. My question is are these Phred33 scores and if not how do they differ? Albacore places the basecalled sequences in group "001". I found that the command stats_ reports on group 000 when "--full-tsv" option is used. poretools stats --full-tsv --group 1 -> reports on group 000

Appreciate your help and guidance.

Thanks, Aarthi