Open rudaoshi opened 8 years ago
please check if all model files are in the same directory as training data. Infer program searches models in the path -input_dir, as a result you should move models to input_dir when inference if they are generated in somewhere else when training.
I've seen something similar when:
(1) the testing vocabulary has some words with non zero tf (2) those words aren't associated with any topics in the trained model
This can happen even when the test dataset is the same as the training dataset.
Omitting those words from the corpus resolves the issue and the inferencing results still look OK. On a small data set of ~1000 documents these words were only 0.2% of the vocabulary. I now do this as a preprocessing step in my data pipeline. I'm not sure if this is a legitimate thing to do or not :)
HTH!
I have same problem, however I have move the model to input dir and the log show it have loaded model, but it still dose not works.
I have met the same problem: Fail to build alias row, capacity of row = 0. I have moved block.0 vocab.0 vocab.0.txt, trained model in the input_dir. Anyone could help?
Hi, I've trained a model with light lda, and want to infer topic for new documents. However, when I use the infer program, it gives an error : Fail to build alias row, capacity of row = 0. The details are as follows:
Anyone can tell what has happened here?