-
Hi all,
Thanks for the wonderful work. However, I am encountering a problem regarding the output value. For example, when I compare two codes (actual and generated), I get some kind of value like t…
-
https://albertauyeung.github.io/2018/06/03/generating-ngrams.html
N-grams are contiguous sequences of n-items in a sentence. N can be 1, 2 or any other positive integers, although usually we do not…
-
**Describe the bug**
Larger hit count when adding quotes, e.g.
[british missions south pacific](https://search.library.ualberta.ca/symphony?q=british+missions+south+pacific) gives 318
[british miss…
-
### Discussed in https://github.com/datafuselabs/databend/discussions/3899
Originally posted by **wubx** January 19, 2022
Why we need fulltest index :
1. In APM a lot of query like : where…
wubx updated
8 months ago
-
Appreciate your efforts on this excellent work!
I found that there's a ngram encoding step is processed before hashing in minhash_spark.py. If the length of a doc is below the min_length, then it wil…
-
I would like to do an openalex query for papers (works) while filtering for a list of specific journals. I can fetch the info for `entity = sources` with no problem:
``` r
library(openalexR)
jo…
-
Project should be in TS b/c that would be easier for documentation.
-
#### Problem description
I want to calculate the Word Mover's Distance. After the normalization (`model.init_sims(replace=True)`) of my self made fastText model, the `wmdistance()` function isn't wor…
-
In a sentence like
"The nuclear magnetic resonance is a physical phenomenon in which nuclei in a strong constant magnetic field are perturbed by a weak oscillating magnetic field."
I would expec…
-
**Describe the improvement**
When searching the publications using the search box, I do not get results when the search term as part of a word of e.g. the publication title. For instance, when I sear…