ruidan / Unsupervised-Aspect-Extraction

Code for acl2017 paper "An unsupervised neural attention model for aspect extraction"
Apache License 2.0
335 stars 117 forks source link

Execution order of word2vec and preprocess #26

Closed NovaBurst closed 4 years ago

NovaBurst commented 4 years ago

Hi Ruidan, Thanks for your great work. I have a question of execution order of word2vec.py and preprocess.py if I use other data. As mentioned in README, word2vec should be run first, but after reading the code I found that preprocess reads raw text in dataset folder and generates output to preprocessed_data folder, then word2vec read preprocessed data and generate word embeddings. I wonder whether it is correct to run preprocess first to clean the data and then run word2vec to generate embeddings. Looking forward to your reply. Thanks!

ruidan commented 4 years ago

You are right, you should first execute preprocess.py then word2vec.py. I have changed this in README

NovaBurst commented 4 years ago

Thank you very much for your reply.