susanli2016 / NLP-with-Python

Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more
2.74k stars 2.02k forks source link

Doubt regarding data pass to saved model #32

Open sandeshchand opened 3 years ago

sandeshchand commented 3 years ago

Dear sir, This is simple question but i am not sure about the result.I am working on topic model and using lda algorithm which is unsupervised learning algorithm.I have total 120 documents.i have divided documents into 100 and 20.Firstly, i passed 100 documents to th model and store the result.Then i saved the model and dictionary. Now this saved model is used to predict the topics of 20 documents and got the result.

as we know saved model is used to predict the unseen data. My question is can i pass first 100 documents (documents is used during saving the model)together with 20 documents(unseen document) to the saved model? Is it possible? will it effect the model performance.