n-gram Search Results - Githubissues

1000+ results
for n-gram

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ddangelov/Top2Vec #301

how to get bi-gram and tri-gram and n-gram topic words ?

I remember in LDA and NMF we have configuration parameter called ngram_range where by configuring it as (2,2) or (3,3) we can get topic words as bigrams and trigrams. Is there any such configuration i…

sivachaitanya updated 1 year ago
2
649453932/Chinese-Text-Classification-Pytorch #103

FastText的n_gram_vocab

如果我使用自己的数据，我该如何确定n_gram_vocab 的值呢？

ruarua6666 updated 1 year ago
1
somefood/cs-study #56

[8장] n-gram 알고리즘 질문

> p.[262] 표 질문 질문 : 표에서 입력이 es인데 출력(최종 인덱스 등록)에는 et로 등록되어 있습니다. 이 부분이 왜 이렇게 등록이 되는지 설명해주시면 감사하겠습니다~~!

somefood updated 1 year ago
1
rust-lang/rust #130928

[DESIGN BUG] declarative macros lack of neat way to simulate…

I tried this code: macro match and echo non `const X: Y` pattern is fine. ```rust macro_rules! echo1 { (pub type $ident:ident = $($tt:tt)*) => { pub type $ident = $($tt)*; …

loynoir updated 3 days ago
5
bmschmidt/wordVectors #50

n-grams greater than 2

I was looking to use trigrams because there are significant three-word phrases in my corpus (e.g. "economies in transition" to refer to developing countries). I used the following code in R. statem…

lawest59 updated 6 years ago
1
Thejesh-404/Website_Analyzer #6

n-grams are not sorted

currenty you sort them in ascending order. lets take this for example 1,2,3,4,5,6,7,8,9,10 Now you output the last 5 values 6,7,8,9,10 This will be needed to be sorted again to display it i…

PandaWhoCodes updated 4 years ago
1
microsoft/onnxruntime #4201

StringNormalizer+Tokenizer misses n-grams

**Describe the bug** This is related to issue https://github.com/onnx/sklearn-onnx/pull/485. onnxruntime seems to be missing n-grams if there are stopwords in between. ``ngrams([a b c] , (1, 2)) --> …

xadupre updated 4 years ago
1
giellalt/shared-mul #2

telefonnr-analysator for alle språk

Vi mangler en telefonr-analusator for alle språk. Enten i shared-smi elelr shared-mul. Nå ser det slik ut i lulesamisk, og der blir svenske telefonnr særlig utfordrende da disse får blir "typos" da…

ilm024 updated 2 months ago
4
newtfire/introDH-Hub #104

Mystery Text Discussion: cac.txt

Post your screenshots and discuss your findings about cac.txt here!

ebeshero updated 1 week ago
8
apache/lucene #13802

Should EdgeNGramTokenizer's DEFAULT_MAX_GRAM_SIZE be ONE?

### Description From org.apache.lucene:lucene-analysis-common:9.11.1, the static variable `DEFAULT_MAX_GRAM_SIZE` of EdgeNGramTokenizer is ONE not TWO. Logically, the maximum n-gram size must b…

YeonghyeonKO updated 1 month ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for n-gram

1000+ results
for n-gram