cmu-lib / text_explorer

Shiny app for exploring multiple corpora with text vector models including topic models, word2vec, doc2vec, keyness
0 stars 0 forks source link
text-analysis

exploravec

Shiny app for exploring subsets foa corpus with different methods, including TF-IDF, term frequencies over time, topic modelling, etc.

Data setup

A persistent sqlite database resides in data/shiny.sqlite3 and contains tables tracking documents, metadata, and various ngrams, and is used by the Shiny app. This is meant to acommodate both documents where we have access to full text, as well as documents where we only have n-gram access, such as those from JSTOR DFR.

The sqlite3 file has been shared on Box with project collaborators and must be downloaded from there in order to run the app locally.