dselivanov / text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
http://text2vec.org
Other
851 stars 136 forks source link

Guide to supervised text classification or categorisation using text2vec #285

Closed PriyankaS1489 closed 5 years ago

PriyankaS1489 commented 5 years ago

Can you please help me in the steps? I have come up to text vec word vec collocations. Now I do an LSA but how do I do them on my specific categories I have. I have email contents which needs to route into 12 departments. I have a data which is already classified which can be used to train. Can you guide me please

dselivanov commented 5 years ago

http://text2vec.org/vectorization.html

ср, 14 нояб. 2018 г., 18:57 PriyankaS1489 notifications@github.com:

Can you please help me in the steps? I have come up to text vec word vec collocations. Now I do an LSA but how do I do them on my specific categories I have. I have email contents which needs to route into 12 departments. I have a data which is already classified which can be used to train. Can you guide me please

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/dselivanov/text2vec/issues/285, or mute the thread https://github.com/notifications/unsubscribe-auth/AE4u3dIFitBKXrOw3_jNrU6tEHK6kpmXks5uvC9HgaJpZM4Yd6Xz .

PriyankaS1489 commented 5 years ago

I did all these steps up to building tf idf already but it creates a dgcmatrix. I need to lsa also. How do I use it for random forest technique?

PriyankaS1489 commented 5 years ago

Hi can you tell me how to load dgc matrix for random forest?

dselivanov commented 5 years ago

I think you confuse the purpose of the issue tracker on GitHub. It is primarily for issue reporting and feature discussion. I suggest you to search for tutorials and spend some time to learn basics about NLP.