-
The `docs` object expects (for technical reasons) that all words occur with frequency 1. If words occur several times, they appear several times each with frequency 1.
In the `quanteda` package there…
-
`textmodel_svm()` does not work when the number of documents used to train the classifier exceeds 66,000 on a MacBook Pro with 32GB RAM.
```r
library(quanteda)
#> Package version: 2.0.1
#> Paral…
-
I am having problem with enabling quanteda's parallel computing in R on M3 Macbook. In GitHub, the readme says:
>Windows or macOS users do not have to install TBB or any other packages to enable pa…
-
Given gesistsa/quanteda.proximity#35 and `quanteda::tokens_*()` will not respect `tokenvars`, it would be better to make this a new class for now.
- [x] Create a new class `tokens_with_tokenvars`
…
-
`topicmodels::LDA` is implemented using this library, which I can call directly via Rcpp:
https://sourceforge.net/projects/gibbslda/files/
We can call the library in this way
https://github.…
-
The Arabic has the characters in reverse order (and unconnected).
"word" in Arabic should be:
![image](https://user-images.githubusercontent.com/2182246/56936523-2769d300-6b3b-11e9-9207-147cf05aa…
-
Right now, it's 1, and the token adjacent to it is 2. Seems like these should be 0 and 1.
``` r
library("quanteda")
#> Package version: 4.0.0
#> Unicode version: 14.0
#> ICU version: 71.1
#> P…
-
See https://github.com/quanteda/quanteda/blob/master/R/textstat_readability.R#L22
and https://github.com/unDocUMeantIt/koRpus/blob/master/R/02_method_show.kRp.readability.R
-
I really don't want to install dependencies for functions that I never use. How about taking this on-demand approach?
https://github.com/quanteda/quanteda/blob/36c1ee584e663e155f4e9a27e42645585aeaa…
-
**Issue by [methodds](https://github.com/methodds)**
_Friday Jan 13, 2017 at 12:01 GMT_
_Originally opened as https://github.com/kbenoit/quanteda/issues/452_
----
Hi,
it's really nice to see that …