-
I'm not sure how lightweight you want `stellar` to be, but `quanteda` is a fairly heavy dependency, itself bringing in `ggplot2`, various `Rcpp` packages, half the tidyverse, parallel packages, and ma…
-
Configure script fails to find `opeTBB`, despite `RcppParallel` using it.
```
---> Building R-seededlda
xinstall: mkdir /opt/local/var/macports/build/_opt_PPCSnowLeopardPorts_R_R-seededlda/R-seede…
-
Parallelisation on Linux, macOS, and Windows now works, but users need to have TBB installed.
Currently, the [README](https://github.com/quanteda/quanteda) lists these installation instructions un…
-
Hello,
I think there is an error in the way the LSA is computed in `textmodel_lsa.R`. After the svd decomposition, the `v` singular vectors are usually weighed with the `k` remaining singular value…
-
Compare with the Python implementation, I is not (1/k) / 10000.
Need to reimplement Yule's I.
-
Hello!
I am trying to lemmatize my German language tokens - any hints on how I could do so? E.g. packages to use (optimally in combination with quanteda)?
I'd greatly appreciate any help!
Son…
-
Training any `word2vec()` model fails on Fedora 37 with the binary from the [`iucar/cran` COPR repository](https://copr.fedorainfracloud.org/coprs/iucar/cran/). I first [reported the problem there](ht…
-
To use an older script I am learning about the 3.0 version of `quanteda` and its companion packages.
I see that `textplot_xray.R` raises the following warning:
> Use of `x$ntokens` is discourage…
-
Seems useful to have `as_dfm` for quanteda package as this gains popularity as a data structure solution in R. People will expect to be able to convert to this format easily and quanteda provides mec…
-
# Text Analysis in R
Text analysis is a scientific subspace that is not served well by general tidyverse / datascience tools. It is a focus area of rOpenSci and we have interesting tools and expert…