IntuitionEngineeringTeam / chars2vec

Character-based word embeddings model based on RNN for handling real world texts
Apache License 2.0
171 stars 37 forks source link

Training the model without target values #7

Closed phume closed 4 years ago

phume commented 4 years ago

Hi, I came across your article on hackernoon.com (Chars2vec: character-based language model for handling real world texts with spelling errors and…). It is very interesting.

I am wondering if I can train the chars2vec model without the target (unsupervised model). Basically, I have a list of names and want to vectorize it. I have been using TF-IDF from sklearn to vectorize these names. Then, I would do some plotting, clustering, cosine similarity on these vector.

Thanks.