-
Hi, thanks for this awesome package!
It works well, but I'm having a problem with one word: my dataset contains the word 'santos' quite often (as in, lower case version of the former Colombian pre…
-
Hi,
I'm interested in hearing if inferSent is suitable to determine semantic similarity between sentences? By averaging the vectors for the sentence and measuring cosine distance for example.
-
Grep begins to be long after 1000 books. For stats on a Query, it could be faster to have an index. SQLite is probably the right choice, usable with different language
-
The as.DocumentTermMatrix method offers a p_attribute parameter.
```
tdm %
partition_bundle(s_attribute = "speaker") %>%
as.TermDocumentMatrix(p_attribute = "lemma", verbose = FALSE)
```
…
-
### Problem
Whilst the original language checker is absolutely brilliant, it fails at small ciphertexts, or those with high entropy. An AI solution would be cool, but would be a bit OTT for rigid dat…
-
As we discussed in Athens, we should fix the types a user can pick from during the creation of the ML model.
In the meeting we ended up with this set of primitive types: (rascal syntax for bnf like…
-
It’s been a while in the planning but 3.9 has moved into final release prep. In this release
* Openchatbot support (client and server)
* Major NLP enhancements including direct support for stemmin…
-
To use either Pie or Freeling,
- [ ] add abstraction layer (function `lemmatise` and `lemmatise_pie`, `lemmatise_freeling`) in Python code.
- [ ] add options `--lemmatise fro` et `lemmatise_engine …
-
Using beta 13.
Tag search is case sensitive, it should not be.
Example:
https://www.yiny.org/search?tagsOneOf=Clip => 1 result
https://www.yiny.org/search?tagsOneOf=clip => no result
-
## How to reproduce the behaviour
After upgrading to spacy 2.1.4 it seems that I have to train my language model again. Training data comes from a (rudimentary, but valid) conll-file,
```
1 Dat - …