dselivanov / text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
http://text2vec.org
Other
850 stars 135 forks source link

Definitions of pos_remove characters? #279

Open kmeeker opened 6 years ago

kmeeker commented 6 years ago

I would like to know what all the abbreviations mean? Some I can guess, like "PUNCT", but no idea what "X" might be. I want to retain contractions, but hard to choose options without documentation.

Thanks. Great performance code!

dselivanov commented 6 years ago

Please consult with http://universaldependencies.org/u/pos/