LantaoYu / SeqGAN

Implementation of Sequence Generative Adversarial Nets with Policy Gradient
2.09k stars 711 forks source link

How to train and generate from our own Data? #3

Closed ghost closed 8 years ago

ghost commented 8 years ago

Can you provide any guidance on how to train and generate from my own data? I would like to try SeqGAN with various English poetry and prose, but I am not sure how to change this code to train on my own data and then generate new writing.

LantaoYu commented 8 years ago

In our code, a synthetic environment, we first initialize an oracle model and use it to generate the real data. In the real scenarios, you can just skip the oracle building step and real data generation step and directly load the English poetry data as the real data. Specifically, in https://github.com/LantaoYu/SeqGAN/blob/master/MLE_SeqGAN/sequence_gan.py#L155 , we use the oracle to generate the real data, which is a set of sequences of numbers(index), you can just skip this step and https://github.com/LantaoYu/SeqGAN/blob/master/MLE_SeqGAN/sequence_gan.py#L156 will load the given real data, i.e. the poetry sequences. Note that you should also remove the original evaluation function and use your own evaluation metric if necessary.

Crista23 commented 6 years ago

@jpcreamer @LantaoYu Could you please post the code for training on custom dataset?