n-grams Search Results - Githubissues

1000+ results
for n-grams

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

snipsco/ntm-lasagne #12

Dynamic N-Grams task

# Dynamic N-Grams task I will gather all the progress on the Dynamic N-Grams task in this issue. I will likely update this issue regularly (hopefully), so you may want to unsubscribe from this issue …

tristandeleu updated 9 years ago
1
bstewart/stm #152

N-grams and stm

As far as I understand, there is no way to work with n-grams and the stm package. I haven't found any discussion on this topic. Is that correct? And if yes, is there a practical or theoretical (or…

brunaw updated 4 years ago
12
mozilla/translations #915

Reduce monolingual data for en-lt to investigate distillatio…

In #771 I tested the effects of reducing the distillation data to understand that expensive part of our pipeline. However, we should do it again for the `base` student model, as the other one was done…

gregtatum updated 1 hour ago
1
mounir4023/MSA_POS_Tagger #6

Build the n-grams model

Using the FreqDist and ConditionalFreqDist from NLTK, build the uni-gram bi-gram and trig-gram models for both words and tags.

mounir4023 updated 4 years ago
4
marco-c/crashsimilarity #172

Consider n-grams of functions

marco-c updated 5 years ago
2
BitFunnel/Workbench #18

Many n-grams in corpus

After processing wikipedia with the fixes as of `274293f3af97c507416f6387020507ee99ca3238`, the tail of the DocFreqTable has a lot of n-grams: ~~~ 724ddeaf8cb3c269,1,0,1.93455e-07,Vasilije Veljko …

danluu updated 7 years ago
2
Cpgeragh/Emerging-Technologies #9

Implement Testing for Trigram Model

**Description**: Develop tests to verify the correctness of each function, including text preprocessing and trigram generation. **Checklist**: - [ ] Research testing strategies for NLP models, esp…

Cpgeragh updated 2 days ago
1
MaartenGr/KeyBERT #77

Highligth n_grams index error

https://github.com/MaartenGr/KeyBERT/blob/6ab9af1cfe74a126e709539a2467426d0881945c/keybert/_highlight.py#L94 this line should be `skip = skip - 2`

aucan updated 2 years ago
3
microsoft/onnxruntime #4201

StringNormalizer+Tokenizer misses n-grams

**Describe the bug** This is related to issue https://github.com/onnx/sklearn-onnx/pull/485. onnxruntime seems to be missing n-grams if there are stopwords in between. ``ngrams([a b c] , (1, 2)) --> …

xadupre updated 4 years ago
1
seancarmody/ngramr #47

Count inconsistencies

Some counts are off by 2 to 3 % in version 1.9.3: ``` > x=ngram(c("der","die","der die", "der+die","der die + die"), corpus = "de-2019", smoothing=0, count=TRUE) > x # Ngram data table # Phrase…

lucid-dreams updated 1 month ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for n-grams

1000+ results
for n-grams