-
**What's your use case?**
For Statistics when I use the Contains feature "word" for searching specific words, it returns one or more entries and within which document the word is located. Howev…
-
Search box
Autocomplete jquery
https://link.medium.com/yCQ3kBtZXeb
-
As it can be seen in the code sample below, we get different results if
* we pre-process a text with a certain _**text_processor**_ and then create an Example with a data.Field() without a preproces…
-
https://github.com/miso-belica/jusText/blob/dev/justext/stoplists/German.txt
Most of those words are no stop words. For example "Saison", "Jahrhunderts", "Titel" and many more.
-
I really love this library, and it would be awesome, if support for the Danish Wikipedia was added.
What is needed for this to happen?
-
Is it https://github.com/returntocorp/ocaml-tree-sitter-languages or https://github.com/returntocorp/ocaml-tree-sitter-semgrep that tests different Hack repos against their parser?
Wonder if we can…
-
As the official fuzzer implementation provided by golang, the native fuzzer should be well suited for various usage scenarios. However, currently native fuzzers only support general mutation algorithm…
-
## About
At [^1][^2], we shared a few notes about time series anomaly detection, and forecasting/prediction. Other than using traditional statistics-based time series forecasting methods like [Holt…
amotl updated
4 months ago
-
5 types of lemmas:
* dictionary-like: būti
* ne-, negation: nebūti
* be-, continuative: bebūti
* te-, restrictive-: tebūti
* also combinations of prefixes: nebebūti
-
#### Problem description
When using the Phrases model, words and punctuation are treated alike.
While the corpus can be cleaned previously, it will destroy the corpus structure that is useful for …