-
* Primary packages for this project will be nltk, pandas, numpy, and scikit learn.
* Had no problem reading in a sample 10-k filing as a .txt and tokenizing the text.
* Was also able to create s…
-
Determine the quality of text outputted by pytesseract using natural language processing. This will allow for a measure of how good the output text is, and whether the program should continue on to TT…
-
File "/public/home/acoqh58ab2/miniconda3/envs/torch1.13_py38_dtk23.10/lib/python3.8/site-packages/nltk/downloader.py", line 952, in _update_index
ElementTree.parse(urlopen(self._url)).getroot…
-
**Describe the bug**
I am trying to test a batch/transform job locally on my computer but I am getting the following error at the end of the **transform** method.
"RuntimeError: Failed to run: ['d…
-
Hello
Please I am following this tutorial to create my French Language model : https://github.com/kmario23/KenLM-training
But when I type this cmd :
`bzcat ./data_final/vocabulary.txt.bz2 | pyt…
-
Hi, would it be possible to make the user warnings display only when using pipes that actually depend on these imports? Or at least display them in a way that allows filtering out (with logging packag…
-
Our current tokenizer is... [rather simple](https://github.com/commonsearch/cosr-back/blob/fc134a3cec7b6ffa3169f80d96ce377ca767a5e1/cosrlib/document/__init__.py#L130) :)
Let's discuss what would be r…
-
Hi,
since the command (with the new version of ChatterBot) `python3 manage.py train` is no more supported, how can I train my online bot implemented with Django? I read that I have to create a new py…
ghost updated
2 years ago
-
Hi, Thanks for the repository.
Bdw is there a convenient script to measure the four metrics (BLEU, SARI, FGL, FRE, and DIFF) mentioned in the paper?
BEST
-
Some obvious libraries would include scikit-learn, scikit-image (a nionswift pacakge for this is already in progress), hyperspy, numba, pandas, xarray, matplotlib, seaborn, NLTK, tensorflow, keras, Py…