SLDA always produce the same response variable estimate for all the documents

bab2min / tomotopy

Python package of Tomoto, the Topic Modeling Tool

https://bab2min.github.io/tomotopy

MIT License

557 stars 62 forks source link

SLDA always produce the same response variable estimate for all the documents #208

Open zli12321 opened 1 year ago

zli12321 commented 1 year ago

I have a size 13000 dataset with 20 categories and trained SLDA with those labels with K=16. After training, I first call the infer function, then estimate to predict a document from the corpus. For every document trained for the model, the estimate result always returns 19 as the argmax category. Are there any bugs with supervised LDA code?