Monte Carlo Rollouts in PyTorch?

You can refer to the original TensorFlow implementation by the author https://github.com/LantaoYu/SeqGAN .

Essentially you'll have to change the sample function in the generator to generate conditional samples (i.e. given the first T tokens, generate the remaining tokens). Then you'll have to change the train_generator_PG function in main.py. You'll write a loop in which every iteration appends a new token to the sentence, performs rollouts, and collects rewards for the new token. Good luck!

suragnair / seqGAN

Monte Carlo Rollouts in PyTorch? #4