datquocnguyen / LFTM

Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
Other
178 stars 59 forks source link

Can LFTM run the Chinese corpus? #2

Closed lianrzh closed 8 years ago

datquocnguyen commented 8 years ago

YES, it can.

I just tried (for the very first time) getting a Chinese word segmented corpus (from http://alias-i.com/lingpipe/demos/tutorial/chineseTokens/read-me.html).

And LFTM can run on this corpus with the word vectors which are learned by Word2Vec.