About 10% of my paired-end sequences are being reported twice in the output files from Kraken2. The headers are not long enough to be truncated, and the sequence headers are not duplicated in the input fastq.gz files. Sometimes, but not always, the reads are classified differently. For example, this is in the output file:
About 10% of my paired-end sequences are being reported twice in the output files from Kraken2. The headers are not long enough to be truncated, and the sequence headers are not duplicated in the input fastq.gz files. Sometimes, but not always, the reads are classified differently. For example, this is in the output file:
Looking for any guidance on troubleshooting this, or deciding which of the duplicated lines to retain! Thanks.