Kyubyong / wordvectors

Pre-trained word vectors of 30+ languages
MIT License
2.22k stars 393 forks source link

Training specification for pretrained model #24

Open maxmarketit opened 4 years ago

maxmarketit commented 4 years ago

Hello, First of all, thank you for the pre-trained model. Since there are many ways to train a fasttext model for Korean, I am curious about how you trained your model and which corpus you used.

For example, fasttext can be trained with corpus first morpheme-analyzed or not, fasttext has several hyperparameters like window size, n-gram size, and what not.