-
Hi. I have written an abstraction layer around multiple libraries doing word splitting (`londonisacapitalofgreatbritain` must become `london is a capital of great britain`). All the libs rely on prepr…
-
E.g. if I want sequences of integers, with ngrams appended to the end?
-
The reworked NGRAMS Documentation (the GitHub Wiki is temporary) should include request examples for at least curl, JavaScript, TypeScript, Python, and NodeJS.
A tool to map curl commands to other …
-
Your n_gram_creator works fine how you have it, but here is another way to write it using a foreach loop in case you want to take a look:
```
def ngram_creator(text_list):
ngrams = []
…
-
I am trying to use KernelShap with Pytorch and I am getting an error when using it that says:
```AssertionError: Unknown type passed as data object: ```
Here is the model and predict function: …
-
**Describe the bug**
The `str.character_ngrams` function produces token `` for strings which are lesser than the provided `n` (shown in image for the case of bigrams).
![result output](https://githu…
-
Could we perhaps add some shortcuts to the individual jmdictdb entry pages for checking the ngrams for all kanji and readings? Maybe not for everyone but at least for loggged-in editors?
For exampl…
-
As discussed by @szhengac https://github.com/dmlc/gluon-nlp/pull/529#discussion_r255815817, the classification script does not follow the paper. No word-ngram hashing is used.
leezu updated
5 years ago
-
Another way to write the loop using a for each instead of a standard for loop would be:
```
def ngram_creator(text_list):
ngrams = []
lastword = None
for word in text_list:
…
-
I try to read the source code。
```golang
func newRecognizer_8859_2(language string, ngram *[64]uint32) *recognizerSingleByte {
return &recognizerSingleByte{
charset: "ISO-8859-2",
h…