Open hanuganu opened 4 years ago
In the paper you maintained about verifying the baseline reward approach and the baseline reward estimator was pretrained. But i couldn't see the pretraining code !. Can you help me with the pretraining code ??
In the paper you maintained about verifying the baseline reward approach and the baseline reward estimator was pretrained. But i couldn't see the pretraining code !. Can you help me with the pretraining code ??