-
anyway to Multiprocessing of NLTK ngrams?
it seems to be that nltk did not come with function to do it? any example?
km5ar updated
1 month ago
-
**Describe the bug**
Calling jaccard_index on long strings leads to `OverflowError: CUDF failure at: /opt/conda/conda-bld/work/cpp/include/cudf/detail/sizes_to_offsets_iterator.cuh:323: Size of outpu…
-
Code at https://github.com/sanskrit-lexicon/CORRECTIONS/tree/master/ngram
Bigrams not found in base dictionary, but found in any of the test dictionary is listed below.
Trigrams are on the way.
# Big…
-
![ngrams](https://user-images.githubusercontent.com/4312244/154011999-9d3c63d6-dbc9-4ced-954b-8293fbd58891.gif)
-
Does this model takes care of ngrams like "hot dog" = "hotdog", "ice cream" = "icecream"??
I have these ngrams in my training data
Also what if i want to remove words which are not corrected and…
-
Hi,
It is months that I am trying to download [ngrams-en-20150817.zip](https://languagetool.org/download/ngram-data/ngrams-en-20150817.zip), but it fails every time at around 2 – 3 Gb with no possibi…
-
https://impresso-project.ch/app/search/ngrams?sq=CgQIARgC&unigrams=fukushima,chernobyl,tschernobyl
{ "route": [ "search", "find" ], "message": "ResourceRequest timed out", "code": 500, "name": "Gen…
-
It would be cool to have an option to define a group of letters to subset the selection with. Thus allowing a layout agnostic way of quickly specifying a custom training for only a single hand, certai…
-
Version 24.2.3.70, index used:
```
CREATE TABLE t
(
`tenant` String,
`recordTimestamp` Int64,
`responseBody` String,
`colAlias` String ALIAS responseBody || 'something else',
…
-
Should also test if n-grams have proper class.