cemoody / lda2vec

MIT License
3.15k stars 628 forks source link

No module named unicode #55

Open SidrahJunaid opened 7 years ago

SidrahJunaid commented 7 years ago

Im using python 3.5 and when im trying to run processor.py its showing an error No module named unicode.Can anyone help me how to fix it?

blairdev1023 commented 7 years ago

I'm not entirely sure if this is the issue but in the source code there's a spot that converts the cleaned text to unicode. In Python 3, the command unicode() was deprecated and just put into str(). Perhaps you could just edit the source code in the conda files (I installed lda2vec with anaconda). Or you could also run your script in a Python 2 environment.

As a side note, I'd really suggest that the author start writing this module in Python 3 and not 2. My experience in the NLP world is that EVERYONE uses 3 precisely because it handles unicode errors so much easier. In fact if I wasn't so terribly fascinated with NLP I probably wouldn't have transitioned over to Python 3.