Hi,
Thanks for the awesome work, I meet some questions on preprocess the data.
I use your release data in nc-v11, but find no "nc-v11.tok_le50.json" and "vectors.en.st" file on the release data.
I try to use the dual_to_seq/data scrips to generate the json file and use the pre-trained embedding file. are this the true preprocess methods on the "dual2seq" model?
Thank you very much!
Hi, Thanks for the awesome work, I meet some questions on preprocess the data.