Closed graceblackwell closed 8 months ago
Hi @graceblackwell , as a temporary workaround you can safely derive the percentage of matched kmers from the number of kmers present divided by (length of query - length of kmer + 1).
The cobs index search does not account for duplication in query kmers, i.e it does not do de-duplication before searching. This is different from bigsi in which the query kmers are deduplicated. For example, if the query is AAAA
and the kmer length is 3, then for cobs index search, there are going to be 2 matched kmers for AAA
, while for bigsi index search, the result will be 1 matched kmer.
Could the percentage/proportion of query kmers present in each sample be reported rather than the number of kmers present?