Closed judembo closed 6 years ago
Unfortunately the implementation only supports using both source and target monolingual data. you can use dummy source monolingual data and remove everything related to trans_x (line 132) in binmt.py. That will ignore the source monolingual data.
Thank you, looks like I got it running.
I'm trying to use the Semi-Supervised Training on the theano branch, but only with monolingual data from the target language (and no monolingual data from the source language). They do the same in Cheng et al. (2016), but it seems like THUMT requires monolingual data from both languages. Am I missing something or is there an easy way to do this without changing a lot of code? Thank you!