-
#### Description
The `TfidfVectorizer` does not honor the `ngram_range` argument when the `vocabulary` is provided.
#### Steps/Code to Reproduce
Example 1, vocabulary is *not* provide…
-
Related to #399 and #417 and #401.
There's a problem where important words that should be focused on aren't showing up in fingerprints. It's actually getting worse as there are more concept cards w…
-
Your n_gram_creator works fine how you have it, but here is another way to write it using a foreach loop in case you want to take a look:
```
def ngram_creator(text_list):
ngrams = []
…
-
## 🚀 Feature
The current `text_classification` sets all use the "basic english" tokenizer which cannot be changed. I would propose that a `tokenizer` argument is added to `_csv_iterator` and `_setup_…
-
The google ngrams show frequency over time of certain ngrams (n-length phrases of words).
It would be interesting to use this to create a massive markov model to generate sentences typical of differe…
-
I changed the pad_symbol as left_pad_symbol, right_pad_symbol and add start_pad_symbol in KneserNeyLM, but there still another eroor. We may use log function with a negative value,but why it was neg…
-
Hi, I'm a beginner in Kaldi, and I ran into the above issue when executing make_mfcc.sh for the train_s folder.
I checked the file using head and tail, but it looked fine to me with sorted utt-id o…
-
https://github.com/in617/Codecademy-Capstone-Murder-Mystery/blob/master/Murder%2BMystery-Copy6.py#L169-L173
Your n_gram_creator works fine how you have it, but here is another way to write it using…
-
I have some really simple code to search the tweets for ngrams I've built (using Python 3.4):
``` python
#!/usr/bin/env python
import sys
import argparse
import json
import re
argparser = argparse.…
-
## My Environment
* __ArangoDB Version__: 3.4.0-RC.6
* __Storage Engine__: RocksDB
* __Deployment Mode__: Single Server
* __Deployment Strategy__: Docker Swarm
* __C…