-
It looks like it is not taking into account the unidentified minhashes that are subtracted from the query sketch before entering into `GatherResult`.
this may be the source of the difference notice…
-
When I run `multisearch`, using the code shown below, the number of output matches varies depending on the sample (for example, 0 matches, 760 matches, and 165 matches). It is my understanding that th…
-
I am working on this pangenome database idea at https://github.com/ctb/2022-database-covers/. I may have found an error in the taxonomic classification while trying to figure some stuff out with the s…
-
see https://github.com/apetkau/genomics-data-index/issues/39 (linked to https://github.com/sourmash-bio/sourmash/issues/137) which reports:
```
sourmash.exceptions.Panic: sourmash panicked: \
thr…
-
Protein k-mer sizes are k=k*3 internally, but k in a manifest.
In`signature::Select`, we account for this discrepancy like this:
```
valid = if let Some(ksize) = selection.ksize() {
let k =…
-
**Submitting Author:** Tessa Pierce-Ward (@bluegenes)
**All current maintainers:** @ctb, @luizirber, @bluegenes
**Package Name:** sourmash
**One-Line Description of Package:** `sourmash` is a comma…
-
Working with data from [https://frl.publisso.de/data/frl:6425521/marine/short_read/marmgCAMI2_sample_0_reads.tar.gz](here) as the sample data, both `yacht train` and `yacht run` run into a similar err…
-
Dear FracMinHash team,
My understanding of FracMinHash is that a much smaller number of hash functions can be used for estimating containment Jaccard and then Jaccard index (after obtaining the car…
-
I recently had this use case: I have a large database of say phages, and I want to know if they are contained in my query genome.
Ideally, I'd like to index them in an SBT and then search using con…
-
I'm on day 3 of a gather of the Hu S1 dataset against all genbank (500k+ genomes), and chatting with @taylorreiter about the find-the-unassigned script that @luizirber wrote,
https://github.com/ta…