I encountered that with some datasets, the final results of the training phase aren't stored under --directory if I use a random_seed of 13712 while hyper-sampling the concentration parameters as well. Only the file state.log would be produced, but not any other output files.
The program will run smoothly and no error would be raised. However, the output directory would contain only the state.log file and the interim outputs, where we expect also mode.bin, mode-topics.dat and mode-word-assignments.dat. As far as I can tell, the combination of --sample_hyper yes and --random_seed 13712 is causing this fault to occur on selected datasets.
I encountered that with some datasets, the final results of the training phase aren't stored under
--directory
if I use arandom_seed
of13712
while hyper-sampling the concentration parameters as well. Only the filestate.log
would be produced, but not any other output files.To reproduce the problem:
(I used
gensim
to generate the LDA-C corpora)The program will run smoothly and no error would be raised. However, the output directory would contain only the
state.log
file and the interim outputs, where we expect alsomode.bin
,mode-topics.dat
andmode-word-assignments.dat
. As far as I can tell, the combination of--sample_hyper yes
and--random_seed 13712
is causing this fault to occur on selected datasets.