TommyJones / tidylda

Implements an algorithim for Latent Dirichlet Allocation using style conventions from the [tidyverse](https://style.tidyverse.org/) and [tidymodels](https://tidymodels.github.io/model-implementation-principles/index.html).
Other
41 stars 3 forks source link

augment() does not respect columns passed to it #67

Closed TommyJones closed 2 years ago

TommyJones commented 2 years ago

augment() has two issues:

  1. Any additional columns in data are dropped in the output, requiring the user to re-merge
  2. augment() expects columns named "term" and "document". Better to specify the name of the "term" and "document" columns, rather than having users rename them.
TommyJones commented 2 years ago

https://github.com/TommyJones/tidylda/commit/34da3d59c67ed40b2b72e0e081bd15663e93879b