OLC-Bioinformatics / ConFindr

Intra-species bacterial contamination detection
https://olc-bioinformatics.github.io/ConFindr/
MIT License
22 stars 8 forks source link

Trimmed or Untrimmed reads? #23

Closed vappiah closed 3 years ago

vappiah commented 3 years ago

Dear developers,

I would like to identify contamination in my Salmonella genomes. What is the acceptable way to run? Before or after trimming of the raw paired reads.

adamkoziol commented 3 years ago

Hi Vincent,

We don't trim reads before processing with ConFindr. As part of its workflow, ConFindr, by default, requires the quality score of a base to be 20 in order to be counted. However, if you have particularly low quality reads and/or high sequencing depth, you might find that trimming reduces the number of false positive calls.

Best, A

vappiah commented 3 years ago

Thanks @adamkoziol