Closed JappyPing closed 8 months ago
Hi, thank you for reaching out.
As far as I see there are two sources:
@eseiler and @smehringer This seems to be much more your field of expertise than mine. What do you think about this?
Independent of the documentation.
A k-mer over dna5 will require at least log_2(5) ≈ 2.3219 bits, which means shapes with max 27 ones are possible. If you changed it to dna4, you could go up to 32 ones inside your shape.
Independent of the documentation.
A k-mer over dna5 will require at least log_2(5) ≈ 2.3219 bits, which means shapes with max 27 ones are possible. If you changed it to dna4, you could go up to 32 ones inside your shape.
Thanks so much for the explanation. I now have a better understanding.
Hi,
Many thanks for this excellent library in computational biology. I have a question when I was using the API. Please see my question below.
Platform
Question
I wanted to hash each kmer of a sequence using the hash method in Seqan3. The used kmer size is 28, so the shape is also 28. but
"terminate called after throwing an instance of 'std::invalid_argument' what(): The chosen shape/alphabet combination is not valid. The alphabet or shape size must be reduced."
However, I read the documentation; it says the maximum size of the shape is 58.
So, is all my understanding correct? or is this a bug?
Thanks for your time.
Best regards,
Jappy