For checking whether or not the file is paired end, the first 1000 reads are used. This was done originally to significantly improve runtime. However, when there are less than 1000 reads in the file present, i.e. as in some very small testdata, than this results in truly paired end reads, being detected as single end reads.
Solution idea:
check for the Toal number of reads. If they are above 1000, then proceed as no, otherwise take the total number of reads
For checking whether or not the file is paired end, the first 1000 reads are used. This was done originally to significantly improve runtime. However, when there are less than 1000 reads in the file present, i.e. as in some very small testdata, than this results in truly paired end reads, being detected as single end reads.
Solution idea:
check for the Toal number of reads. If they are above 1000, then proceed as no, otherwise take the total number of reads