I have read your paper and I'm trying to replicate it using MovieLens.
From the text it wasn't entirely clear to me whether you also optimize the UserRequestEncoder separately and if that is the case, would you use the terminal (minibatch) loss for that?
So in the training-loop you would optimise as follows:
Hi there,
I have read your paper and I'm trying to replicate it using MovieLens. From the text it wasn't entirely clear to me whether you also optimize the UserRequestEncoder separately and if that is the case, would you use the terminal (minibatch) loss for that?
So in the training-loop you would optimise as follows:
where theta is the forward model, phi is the flow model and ure is the user-request encoder?
So far I have not been able to train the model with great success (used my own implementation). But I'm wondering if this is what causes it.
Thanks in advance!