Documentation on Methodology

Hi,

Thank you for your interest in the TextRL library! While I can't provide specific papers corresponding to the implementation, as TextRL is a composition of multiple techniques and ideas, I can provide you with a list of papers that are related to the general concepts of text generation, reinforcement learning, and fine-tuning pre-trained language models. These papers might help you understand the techniques and motivation behind TextRL.

https://rl4lms.apps.allenai.org https://github.com/anthropics/hh-rlhf/tree/master

If you're looking for more specific details, you can check out the documentation and source code of the libraries that TextRL builds upon, such as Hugging Face's Transformers, PFRL, and OpenAI GYM.

voidful / TextRL

Documentation on Methodology #18