Closed sashafrey closed 10 years ago
After https://github.com/sashafrey/topicmod/commit/be50848f4e41956366c351ab28c013667257447d perplexity calculation uses document unigram model. This is an "optimistic" estimate. An option to use collection unigram model will give a "pesimistic" estimate, which is better.
Fixed by https://github.com/sashafrey/topicmod/commit/189c9b4f808e313a37e6e5e7826c0f37850fef7c
After https://github.com/sashafrey/topicmod/commit/be50848f4e41956366c351ab28c013667257447d perplexity calculation uses document unigram model. This is an "optimistic" estimate. An option to use collection unigram model will give a "pesimistic" estimate, which is better.