I am having some trouble understanding GenomeTester4 version 4.2.16 (stable) functionality, notably with the kmer frequency thresholds. For example, I have two datasets A and B and am interested in kmers that are unique for each:
So far, so good: the list has only those kmers that are in either A or B but not both. However, kmers that occur only once could very well represent sequencing errors. therefore, I want to have a list of unique k-mer that occur at least twice. I assume this can be done using the cutoff frequency. However, this does not work as expected as the second-last kmer occurs only once:
I am having some trouble understanding GenomeTester4 version 4.2.16 (stable) functionality, notably with the kmer frequency thresholds. For example, I have two datasets A and B and am interested in kmers that are unique for each:
So far, so good: the list has only those kmers that are in either A or B but not both. However, kmers that occur only once could very well represent sequencing errors. therefore, I want to have a list of unique k-mer that occur at least twice. I assume this can be done using the cutoff frequency. However, this does not work as expected as the second-last kmer occurs only once:
Likewise, when I specify a minimum frequency when querying the files, this does not have any effect:
Is this a bug or am I missing something?
Thanks