-
**Is your feature request related to a problem? Please describe.**
I'd like to perform various distance/similarity functions.
- [ ] `cosine`
- [ ] `dot_product`
- [ ] `euclidian`
- [ ] `pearson`…
-
Jaccard similarity of all pairwise combinations of variants.
Extract the set of mutations for each variant, then compare pairwise across all variants.
There are a couple of interesting cases:
…
-
```
We can compare strings fuzzily and we can compare Jaccard similarity.
It would be nice to combine the two, using fuzzy string matching to determine
equivalence of strings to feed into the Jaccar…
-
"If you like test-retest reliability of task fMRI, the MID task or ABCD data, you may* like our Stage 2 Reg. Report submitted [@PeerCommunityIn](https://x.com/PeerCommunityIn)
. Overview w/ all the f…
-
```
We can compare strings fuzzily and we can compare Jaccard similarity.
It would be nice to combine the two, using fuzzy string matching to determine
equivalence of strings to feed into the Jaccar…
-
I've done quite a bit of googling recently on this as, at only 20MB it is becoming really slow, especially when using Jaccard similarity.
My proposed solution is to use something like the MinHash a…
-
The check for null case should be done at the token bag level rather than the string level:
https://github.com/OlivierBinette/StringCompare/blob/be58f4c1c9c24bc2cef5d9bb81053fa7ea003792/stringcompa…
-
When I use `hlarp` to compare HLA type from different samples for the same patient, I get output for each sample name separately. Is there a way to force `hlarp` to ignore the sample description in th…
-
Turned native: true, and Recommendify would fail silently w/ Jaccard similarity instead of throwing some error when native extensions are failing. (if not built, I do get a proper error message: recom…
-
As I was thinking about the ANI stuff #1967 https://github.com/sourmash-bio/sourmash/issues/2001 I came up with an idea. 💡
right now, search outputs largely useless CSV files, with minimal informat…