-
Hi,
Pipe operator `mlr3pipelines::PipeOpTextVectorizer` is painfully slow in comparison with `quanteda::dfm()`:
````
library(mlr3)
library(mlr3learners)
library(mlr3pipelines)
library(quante…
-
Hi
In example:
you cannot run this:
yake.cfg["candidate_selection"] = {"ngram": 3}
because TypeError: unhashable type: 'dict'
-
Hey,
I hope you're well.
First, I want you to know that I greatly appreciate your Ngram Type program. I started using your little tool a few months ago to learn the basic patterns to help touc…
-
Hey all,
I have been facing an issue with UniLM V1, where the model has been fine-tuned for a Case correction task. Case correction is the problem where the model figures out which words need to be…
-
to assist with the curation of related content, e.g. as demoed via
https://www.wikidata.org/wiki/Wikidata:University_of_Virginia/Listeria/UVa_people/Common_words_in_titles_of_UVA-coauthored_publicati…
-
Hi,
I've just installed the latest version of textacy in python 2.7 on a Mac. I am trying to extract topics from a set of comments that do have quite a few non-ASCII characters. The topics I …
-
It seems that users face problems when they work with ngrams: https://stackoverflow.com/questions/46685498/remove-ngrams-with-leading-and-trailing-stopwords
Then, how about making a `ngram` wrapper…
-
Currently the index doesn't match until after the first two chars in many cases. Mostly because the size of the index explodes. This is probably ok for the first two characters, but not great on the s…
-
つれこむ and 連れ込む can be recognized as one verb, but either 連れこむ or つれ込む could not.
This behavior seems reasonable as we don't want it to read 沢さん as 沢山, but for verbs especially simple composite verb …
-
1. What are the differences in language, themes, and motifs between the original trilogy, prequel trilogy, and sequel trilogy of Star Wars scripts?
2. What linguistic patterns and rhetorical devices …