Closed ghost closed 6 years ago
Hi, I'm wondering what does the Japanese training data look like. Are they segmented by word or by character? also the data for training word2vec, are they segmented in the same way?
In the same way of English dataset, Japanese training data are also segmented by word.
I used pre-trained word embeddings.
Hi, I'm wondering what does the Japanese training data look like. Are they segmented by word or by character? also the data for training word2vec, are they segmented in the same way?