openbigdatagroup / plda

PLDA: Parallel Latent Dirichlet Allocation in C++
http://openbigdatagroup.github.io/plda
Apache License 2.0
85 stars 31 forks source link

Is the topic distribution computation in LDASampler::GenerateTopicDistributionForWord correct? #1

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
It looks like the distribution computation in 
LDASampler::GenerateTopicDistributionForWord has:

    distribution->push_back(
        (topic_word_factor + beta_) *
        (document_topic_factor + alpha_) /
        (global_topic_factor + num_topics * beta_));

But based on Algorithm 3.1 in 
http://www.datalab.uci.edu/papers/fastLDA_2008.pdf, shouldn't the 
denominator in the value above be (global_topic_factor + num_words * 
beta_)?

Original issue reported on code.google.com by ritz.kar@gmail.com on 14 Mar 2009 at 7:37

GoogleCodeExporter commented 8 years ago
Thanks Ritz.  You are right.  We are correcting this defect.

Original comment by Yi.Wang.2005 on 16 Mar 2009 at 6:29

GoogleCodeExporter commented 8 years ago

Original comment by baihong...@gmail.com on 16 Mar 2009 at 6:34