Closed MathieuCharles closed 7 years ago
Thanks @MathieuCharles for noting this issue.
It looks like i
is a valid character in the phred64 (old Illumina) scale. The issue instead is :
in the barcode qual scores, which is only valid in phred33.
However, we can ignore the barcode qual scores, since these are artificial. The sequencing run used an old Illumina format where barcodes are provided in the header line without phred scores, in lieu of a separate barcode fastq. Hence, artificial barcode fastq files were generated to support processing in qiime, but these were mistakenly made with phred33 artificial scores.
I am updating the mock-7
and mock-8
raw barcode files, but in the meantime you can fix these files yourself with the following command:
gunzip -c barcodes.fastq.gz | sed 's/::::::::::::/KKKKKKKKKKKK/g' | gzip -c > barcodes2.fastq.gz
The fixed files should work in the example commands that you provided.
A final question, does the sequence still contains primer sequences ?
No, the primer sequences do not appear to be present in the sequence reads.
As reported in #57 , I encountered some trouble using mock 7 and 8.
I am using qiime 19.1
It seems that the phred quality score is not valid.
The indicated sequence contain "i" quality character, so corresponding to ascii 105, which is out of the classical score scale.
for mock 8
I checked the sequence but the ascii value are between 66 and 104, so it should work.
Do you have same trouble ? What can I do to solve this ?
A final question, does the sequence still contains primer sequences ?