Add better fastq handling

What does it do?

Re-implements FASTQ extraction to remove problematic reads

Related issues

Lately, I have been having a lot of issues with the following:

Non-ASCII encoded characters
ASCII control characters
Un-equal seq and quality strings.

This implementation now scans the entire FASTQ (SE and PE) and removes problems reads. If the headers don't match up in PE data then it is automatically switched to SE. I hope this will solve some of the instability I see in the pipeline which typically traces back to one of these problems.

jfear / ncbi_remap

Add better fastq handling #117

What does it do?

Related issues