-
Hi, I have a quick question about embeddings used during the evaluation.
In README.md, the crosslingual evaluation `python evaluate.py --src_lang en --tgt_lang es --src_emb data/wiki.en-es.en.vec --t…
-
according to the docs
```
when set to "identical_char" it will use identical character strings between source and target languages to form a vocabulary.````
```
I understood that the dictionar…
-
Thanks for this wonderful project!
I found I can not evaluate on cross-lingual word similarity task (i.e., SEMEVAL17 task).
1. in `get_evaluation.sh`, the eval data are `crosslingual/wordsim/$lg…
rgtjf updated
6 years ago
-
Hi, I was trying your method on the unsupervised setting, with en-fr language pair. I trained my embedding model using fastText on newstest 2014. Dictionary sizes of en and fr are 1962 and 2018.
I do…
-
Hi, @glample
Here is the output during the training progress. And I find these files(**best_mapping.t7 params.pkl train.log vectors-latin.txt vectors-zh.txt**) in the dumped folder. But how coul…
-
I was trying to build cross-lingual word embeddings for Malayalam and Hindi.
Environment : Ubuntu 16, 8CPUs/52GB RAM, Tesla K80, Google Cloud, CUDA 8, Python 3.6, Faiss not installed
This is wh…