-
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
I'm want to generate parquet files that efficiently support prefix, substring, and suffix q…
-
Use bigrams and trigrams of 'kAzikA', 'bAlamanoramA' or other grammar text as basis and find out odd bigrams and trigrams for corrections.
-
## ❓ Questions and Help
**Description**
Hi,
I saw the document of torchText, function 'ngrams_iterator' has become a yield, when you call 'build_vocab_from_iterator ' you still have 'yield ngr…
-
Hi @m4rc1e! I was just about to start trying this out and I see that there isn't an N'ko wordlist. Wondering if one could be created. Fortunately, there is a tool that is being used to collect N'ko wo…
-
Ubuntu 18.04
tensorflow 2.2.0
tfx 0.21.4
I am generating a vocabulary from my own dataset containing 28GB TFRecords with short description strings (up to 20 words) and integer labels from 1-100.
…
-
```
In order to ensure that always the same ngram feature are generated, we use
meta collectors that collect the frequencies of all ngrams and then select the
top k as features.
The ngram annotator…
-
This will be a page that describes the data about the current spotteds.
Make it be updated every ~10 min
Features can vary from:
- [ ] Last Spotted Approved by API
- [ ] Last Spotted Approved by a…
-
I am trying to replicate the return value of `starspace_embedding()` function. Here is what I have found so far.
When training a model with ngrams = 1, `starspace_embedding(model, 'word1 word2')` …
-
I am curious of the rational of replacing consecutive whitespaces with just a single space character for [`CountVectorizer(analyzer='char')`](https://github.com/scikit-learn/scikit-learn/blob/51a765a/…
yxtay updated
2 years ago
-
```
I was looking at the DKPro functionality, and I saw this page about how to work
with NGrams in DKPro
http://code.google.com/p/dkpro-core-asl/wiki/WorkingWithNGrams
While we have all of this func…