-
Hey lucas! If you're going to use a list of frequent bigrams/trigrams anyways, you don't need to do the first part where you generate all sane permutations of the input string, right?
That is, if …
-
Hi Maarten,
I think there is a bug in the OpenAI representation model in the way the prompt is generated. The keywords are only separated by a space, not a comma, which is problematic for n-grams >…
-
The LanguageTool wiki [describes how to use n-gram data to detect additional error types](http://wiki.languagetool.org/finding-errors-using-n-gram-data), and provides n-gram data for this purpose. Is …
-
### Your current environment
The startup command is as follows: it initiates both a standard 7B model and an n-gram speculate model. Speed tests discover that the speculate model performs more slowl…
-
Sadly, mongoid full text seach appears to be the only option available. The choice of Mongoid was purely experimental, for learning purposes, but I'm finding that I'm somewhat regretting it.
Bear in…
-
hi there,
when I use minhash with lsh or simhash, it's hard to remove short text. anybody could provide some useful method to solve this problem, thanks a ton!
take below example, and dive…
-
Awesome work! I already practiced n-grams on several platform with the ones I extracted from text corpora myself.
Would it be possible to have a tag in order to chose a language that specifies th…
-
The way JMdict currently works makes it hard to correctly interpret the frequency of certain forms.
This was also (kinda) discussed in #113.
Let's take 曲がりなりにも for example:
Form | N-grams | %
--…
-
Hi every one, do we have a shallow fusion implementation with N-gram Language Model (e.g Kenlm or Srilm) instead of Neural Language Model? If not, can you give me some instructions to do shallow fusio…
-
用textcnn,显示没有n_gram_vocab