seededlda is an R package that implements Seeded LDA (Latent Dirichlet Allocation) for semi-supervised topic modeling based on quanteda. Initially, the package was a simple wrapper around the topicmodels package, but it was fully rewritten in C++ using the GibbsLDA++ library and submitted to CRAN as version 0.5 in 2020. The package was further developed to add the sequential classification (Sequential LDA) and parallel computing (Distributed LDA) capabilities and released as version 1.0 in 2023.
keyATM is the latest addition to the semi-supervised topic models. The users of Seeded LDA are also encouraged to download that package.
From CRAN:
install.packages("seededlda")
From Github:
devtools::install_github("koheiw/seededlda")
Please visit the package website for examples:
Please read the following papers on the algorithms.
Please read the following papers for how to apply seeded-LDA in social science research: