ngrams Search Results - Githubissues

1000+ results
for ngrams

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

scikit-learn/scikit-learn #16017

TfidfVectorizer ngrams does not work when vocabulary provide…

#### Description The `TfidfVectorizer` does not honor the `ngram_range` argument when the `vocabulary` is provided. #### Steps/Code to Reproduce Example 1, vocabulary is *not* provide…

tgsmith61591 updated 2 years ago
9
jkomoros/card-web #429

Fingerprints don't include important single-word ngrams

Related to #399 and #417 and #401. There's a problem where important words that should be focused on aren't showing up in fingerprints. It's actually getting worse as there are more concept cards w…

jkomoros updated 3 years ago
1
saweaver/pwp-capstones #1

n_gram_creator

Your n_gram_creator works fine how you have it, but here is another way to write it using a foreach loop in case you want to take a look: ``` def ngram_creator(text_list): ngrams = [] …

ad3429 updated 5 years ago
1
pytorch/text #655

Ability to pass custom tokenizer to text_classification data…

## 🚀 Feature The current `text_classification` sets all use the "basic english" tokenizer which cannot be changed. I would propose that a `tokenizer` argument is added to `_csv_iterator` and `_setup_…

bentrevett updated 4 years ago
1
combinatorist/math-and-puzzles #4

Google ngrams: Markov Model and document reconstruction

The google ngrams show frequency over time of certain ngrams (n-length phrases of words). It would be interesting to use this to create a massive markov model to generate sentences typical of differe…

combinatorist updated 8 years ago
2
smilli/kneser-ney #2

ValueError: math domain error

I changed the pad_symbol as left_pad_symbol, right_pad_symbol and add start_pad_symbol in KneserNeyLM, but there still another eroor. We may use log function with a negative value,but why it was neg…

wateryouyou updated 1 year ago
2
laurensw75/kaldi_egs_CGN #5

Mal-formed spk2gender

Hi, I'm a beginner in Kaldi, and I ran into the above issue when executing make_mfcc.sh for the train_s folder. I checked the file using head and tail, but it looked fine to me with sorted utt-id o…

JeromeNi updated 4 years ago
9
in617/Codecademy-Capstone-Murder-Mystery #1

n_gram_creator

https://github.com/in617/Codecademy-Capstone-Murder-Mystery/blob/master/Murder%2BMystery-Copy6.py#L169-L173 Your n_gram_creator works fine how you have it, but here is another way to write it using…

ad3429 updated 5 years ago
1
reidpr/quac #109

error parsing TSV: 'utf-8' codec can't decode byte 0xed in p…

I have some really simple code to search the tweets for ngrams I've built (using Python 3.4): ``` python #!/usr/bin/env python import sys import argparse import json import re argparser = argparse.…

gfairchild updated 8 years ago
1
arangodb/arangodb #7825

ArangoSearch STARTS_WITH Prefix Matches not working (+ nGra…

## My Environment * __ArangoDB Version__: 3.4.0-RC.6 * __Storage Engine__: RocksDB * __Deployment Mode__: Single Server * __Deployment Strategy__: Docker Swarm * __C…

JonDum updated 3 years ago
8

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for ngrams

1000+ results
for ngrams