stepthom / lucene-lda

Using latent Dirichlet allocation (LDA) in Apache Lucene
58 stars 23 forks source link

Handle the case of no input file codes #5

Open doofuslarge opened 11 years ago

doofuslarge commented 11 years ago

If the filecodes option is not set (and hence no filename->integer mapping is provided by the user), we need to create an identity mapping that can be used in the query results. (I.e., instead of outputing (fileCode, relevancyScore) tuples in the output, we should just output (fileName, relevancyScore) tuples.)