stepthom / lucene-lda

Using latent Dirichlet allocation (LDA) in Apache Lucene
58 stars 23 forks source link

Add LDAHelper Object to Lucene Index #2

Open doofuslarge opened 11 years ago

doofuslarge commented 11 years ago

Currently, the LDAHelper class (which encapsulates all the LDA functionality) is serialized and written to disk at index time, and then read back again at query time. This is a little clumsy, as it requires the user to specify a filepath for the serialized object at index time, and then regurgitate the same path at query time. It would be easier (and perhaps cleaner) to add all the information in the LDAHelper class to the Lucene index itself. Is this possible? How can we do this?