Closed Ram81 closed 1 year ago
Hi,
I wanted to refer the implementation of RL fine-tuning approach proposed in the paper. I wasn't able to find the training code and instructions for the same. Can someone point me to that?
Hey. Unfortunately only the trained agent parameters, the data and examples on how to run the agent were shared. RL training code was not shared.
Hi,
I wanted to refer the implementation of RL fine-tuning approach proposed in the paper. I wasn't able to find the training code and instructions for the same. Can someone point me to that?