-
```python
import sys
import gzip
from Bio import SeqIO
import sourmash
from sourmash import MinHash
import concurrent.futures
def create_sourmash_sketch(genome, ksizes, scale):
# Create …
-
I am working on this pangenome database idea at https://github.com/ctb/2022-database-covers/. I may have found an error in the taxonomic classification while trying to figure some stuff out with the s…
-
It looks like it is not taking into account the unidentified minhashes that are subtracted from the query sketch before entering into `GatherResult`.
this may be the source of the difference notice…
-
When I run `multisearch`, using the code shown below, the number of output matches varies depending on the sample (for example, 0 matches, 760 matches, and 165 matches). It is my understanding that th…
-
**Submitting Author:** Tessa Pierce-Ward (@bluegenes)
**All current maintainers:** @ctb, @luizirber, @bluegenes
**Package Name:** sourmash
**One-Line Description of Package:** `sourmash` is a comma…
-
Protein k-mer sizes are k=k*3 internally, but k in a manifest.
In`signature::Select`, we account for this discrepancy like this:
```
valid = if let Some(ksize) = selection.ksize() {
let k =…
-
see https://github.com/apetkau/genomics-data-index/issues/39 (linked to https://github.com/sourmash-bio/sourmash/issues/137) which reports:
```
sourmash.exceptions.Panic: sourmash panicked: \
thr…
-
Working with data from [https://frl.publisso.de/data/frl:6425521/marine/short_read/marmgCAMI2_sample_0_reads.tar.gz](here) as the sample data, both `yacht train` and `yacht run` run into a similar err…
-
Dear FracMinHash team,
My understanding of FracMinHash is that a much smaller number of hash functions can be used for estimating containment Jaccard and then Jaccard index (after obtaining the car…
-
I recently had this use case: I have a large database of say phages, and I want to know if they are contained in my query genome.
Ideally, I'd like to index them in an SBT and then search using con…