adjidieng / ETM

Topic Modeling in Embedding Spaces
MIT License
540 stars 127 forks source link

Run ETM on my own dataset #30

Open EJ0917 opened 3 years ago

EJ0917 commented 3 years ago

Hello, Is it possible to know how this data is like: "raw/new_york_times_text/nyt_docs.txt"? I am trying to fit my own dataset but don't know to which type should I transform ... Wish somebody can help!

Thanks!

lxkkk117 commented 2 years ago

Hello, Is it possible to know how this data is like: "raw/new_york_times_text/nyt_docs.txt"? I am trying to fit my own dataset but don't know to which type should I transform ... Wish somebody can help!

Thanks!

i also encounter this problem, have you solve that? could you please tell me how to do, thank you !

manueltonneau commented 2 years ago

Since this repo doesn't seem to be curated anymore, I suggest you use another repo I just discovered: https://github.com/lffloyd/embedded-topic-model I was able to use ETM very easily with it.

Idan-Garay commented 1 year ago

@manueltonneau were you able to predict test_dataset with the model from that repo?