-
In NLTK.metrics aline there's a function that returns distance between a segment of aligned strings.
The problem is when the algorithm aline the strings using aline.aline it is possible to have two vo…
-
`KneserNeyInterpolated.generate()` takes too long to run.
Consider the following example:
```python
from nltk.corpus import brown
from nltk.lm.preprocessing import padded_everygram_pipeline
f…
-
https://github.com/Unstructured-IO/unstructured/blob/01dbc7b4733e88efd6c1e85930c707009a2a966e/unstructured/nlp/tokenize.py#L101-L113
Should prob use the cache here instead of on tokenizers:
`@lru_…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
### What is the issue?
I followed the guide on OSX, I don't think it is a OS problem, and it didn't worked, it was not able to create the config yaml files because there are two folders with the name…
-
****Error:** Error while finding module specification for 'cog.server.http' (ModuleNotFoundError: No module named 'cog')**
Not able to use run command inside build section in cog.yaml
Below is m…
-
Hey there! Super cool project. Thought I'd add some of the (yet to be documented) steps that I took to get the application working on my macbook pro with an M1 chip.
I did not use the docker image …
-
**Description**
punkt-tab should be used for tokenization across the repo and validator repos.
**Why is this needed**
as of nltk 3.8.2, punkt is deprecated in favor of punkt-tab.
**Implementat…
-
I'm having some problems running CFG-Embed, and I can't find the collocations.tab file,How can I get to that file.Can you help me?thanks!
![009ded2e31b6f2382a0c67d64f4492b](https://github.com/user-at…
-
Trying to use the Dockerfile and get the following error:
```
---> Running in 8bb1dcf92b93
/usr/local/lib/python3.9/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_do…