Closed BenAhn closed 5 years ago
Hey! Are you using the default processing in load_20newsgroups.py? I have been messing with the code a lot recently...so some changes may cause end results to fluctuate.
For example, you shouldn't have tokens like "/./lib" showing up. If you aren't using my parameters/preprocessing, can you tell me what you are using instead?
As for the loss - it does that. It's not meant to decrease after a certain point.
As for the version issues - As noted in the main description of this repo, this code is written for Tensorflow 1.5. I should have also specified I used Python 3.5 too. Either way...thank you for letting me know that this works on 3.6.8 w/ tf 1.12.0. Very helpful!!
Additional note on the loss...it's going to stay the same after only a few epochs, but the topics will continue to learn. So, the loss staying the same does not mean that it isn't continuing to learn. It is just constantly taxing the model at the same level.
Thank you for prompt reply!
can you tell me what you are using instead?
the loss staying the same does not mean that it isn't continuing to learn
again thank you~
How does the algorithm learn:
Sana and I had the same issue with strange and sometimes unprintable characters (unicode?)
Closing, however I might make a new issue related to the preprocessing issue. The "loss decreasing" part of this issue is not an issue. +
How does the algorithm learn:
- P(T,W) - probability of a topic given word, starting from the prior dirichlet distribution, without a decreasing loss function?
@dbl001 it learns by making the Dirichlet distribution (the initializer of the topic matrix) more and more sparse over time. Sort of related...here is a question on stackoverflow showing how dirichlet distributions can be manipulated to become more and more sparse.
Thank you for reviewing.
I will wait for your preprocessing updates. and if i have a progress i will share.
From who really want to see LDA2vec is working. ^^
Dear. Mr. Nateraw
First of all thank you for your tensorflow LDA2vec code.
I have tested your code with suggested data, and followed your instruction of installation
After that, i got the result as below. It started loss with 3381 and finished with 3381.
Question
Thank you in advance.
PS - The code is not working with Python 3.7.1, tensorflow-gpu 1.13.1 but python 3.6.8 with tensorflow-gpu 1.12.0 works fine