arq5x / poretools

a toolkit for working with Oxford nanopore data
MIT License
239 stars 90 forks source link

Read lengths of 0 #159

Open rwinand opened 6 years ago

rwinand commented 6 years ago

Hi,

I am analyzing data from a 16S amplicon experiment and I found a difference in number of reads that are in the Albacore fastq files compared to the Poretools fastq files (and other Poretools statistics).

For instance, I have 1282 fast5 files and 1282 reads in the Albacore fastq files but there are only 1182 reads reported by Poretools and saved in the fastq file. After doing some checking, I found that these 100 'missing' reads are reported to have length 0 (by Poretools) although they have varying non-zero lenghts in the Albacore fastq files. I have tried to find anything in common between these reads but have not yet identified anything that would explain this.

Do you have any idea why this is happening?