Closed sunsidazzz closed 7 years ago
The RNN WGAN model is vastly inferior to an RNN trained with the MLE objective. See for example the last page of "Exploring the Limits of Language Modeling" where the results are much better than the ones we obtain. Also, as you state, the training time with MLE is much faster.
This was the first (along with a simultaneous paper published by MILA) work to show that you can train an RNN with just the GAN objective and without MLE. This is just a first step, and we haven't reached the performance of MLE models yet. We believe that if more effort will be put into this research direction we could get to, or even pass, the performance of MLE models with GAN models.
If you have more questions feel free to email any of the authors.
This paper claims that it produces better results than MLE models https://arxiv.org/abs/1801.07736
So I am researching in language models that can generate words.
Your results showed that RNN + GANs improves the quality of generated sequences compared to CNN+GANs. But do you think that the combination of RNN and GANs performs better than an independent RNN model?(LSTM, GRU) Because the resulting sentences you got from RNN+GANs are not coherent, and a well-trained RNN model can do the same job. Also, when I was training your model, I feel like using CL+VL+TH is very time-consuming. So is it really worth to train an RNN with GANs? Or the purpose of this project is just to prove that RNN could work well with GANs?
Thanks, Sida Sun