-
Hi,
I was wondering if you considered using syncmers (I believe it's already implemented in minimap2) to map the genes between different species? I believe synmcers are specifically meant for highe…
-
Hey it's me again :) I briefly corresponded with @ksahlin on a refined version of closed syncmers with 25-30% lower density for realistic values of k. This might be useful for strobealign.
![optima…
-
Thanks for writing `taxor` and including useful databases!
What the community (or maybe just me) really wants is a database that covers more of the microbial kingdom, but with the benefit of GTDB f…
-
Strobealign currently uses [xxh64](https://github.com/ksahlin/strobealign/blob/8a0e047a451f4ff0ae74988f32695de0a0c80ea4/src/randstrobes.cpp#L38) to hash syncmers.
We could switch to a different, fa…
-
### Background
Randstrobes is a method for linking two or more k-mers together into a seed that can be used for sequence mapping. I intend the below post to be a self-contained rough description of…
-
Dear Wei Shen,
I really like your tool and your tutorials. I just have a question regarding long read metagenomic profiling. Is there a specific parameter combination you would recommend to use to ta…
-
In the following data: [data.tar.gz](https://github.com/user-attachments/files/16457383/data.tar.gz)
there is exactly one single-end short read in `input.fasta` and two sets of haplotypes, `full.fa.g…
-
Hi @eric9n,
I open an additional issue here about using a different sketching algorithm, in addition to minimizer. I attached slide explaining the differences between minimizer and open syncmer and…
-
Hi,
I get the following error when running miniprot looking for 200 proteins at a 73MB genome:
[M::mp_ntseq_read@0.375*1.01] read 68163509 bases in 69053 contigs
[M::mp_idx_build@0.379*1.01] 608218…
-
## Question
Hi Seqan-Team,
I just recently read about XOR Filters and that they were superior to Bloom Filters in regards to memory usage and query time. Do you plan (or maybe already started) t…