Open cdgiv opened 6 years ago
Can you elaborate on this part of the code? https://github.com/geek-ai/irgan/blob/master/ltr-gan/ltr-gan-pointwise/ltr_gan_d_nn_g_nn.py#L125-L129
I am trying to understand where the importance sampling came from - I do not see it mentioned anywhere in the papers for IRGAN, REINFORCE, GANs, RankNet, etc.
Is this just from Monte Carlo sampling? If so, can you provide a good reference?
Thanks!
I also have this question
It's a trick widely used in RL. You can search for the keyword: off-policy, importance sampling. Hope it can help you!
Can you elaborate on this part of the code? https://github.com/geek-ai/irgan/blob/master/ltr-gan/ltr-gan-pointwise/ltr_gan_d_nn_g_nn.py#L125-L129
I am trying to understand where the importance sampling came from - I do not see it mentioned anywhere in the papers for IRGAN, REINFORCE, GANs, RankNet, etc.
Is this just from Monte Carlo sampling? If so, can you provide a good reference?
Thanks!