Open Chandanpanda opened 1 year ago
Hi @Chandanpanda
Thank you for using HandyRL. I'll provide you with some helpful information.
You can find some codes for selecting the model to send to generation.py
inside the worker.py
at the following link, but please note that the code at this time may have differences from the latest version of HandyRL:
https://github.com/YuriCat/TamakEriFever/blob/master/handyrl_core/worker.py#L58-L92
You can refer to the following link for an example of performing both supervised learning and reinforcement learning using the same buffer in HandyRL. Prepare the training data and set the replay_rate
in the config. Load the training data to be used by the Replayer
class and select the replay data to be used for training in _select_record()
:
https://github.com/YuriCat/HandyRL/compare/develop...feature/train_with_replay_same_buffer_simplest
We provide HandyRL as a code base, allowing for customization with minimal code. If you have developed good code, we encourage you to submit a pull request to the HandyRL repository!
This is fantastic information! I will review these. Thanks for the prompt response. Regards,Chandan
Sent from Yahoo Mail on Android
On Fri, Jun 23, 2023 at 21:04, Ikki @.***> wrote:
Hi @Chandanpanda
Thank you for using HandyRL. I'll provide you with some helpful information.
You can find some codes for selecting the model to send to generation.py inside the worker.py at the following link, but please note that the code at this time may have differences from the latest version of HandyRL:
https://github.com/YuriCat/TamakEriFever/blob/master/handyrl_core/worker.py#L58-L92
You can refer to the following link for an example of performing both supervised learning and reinforcement learning using the same buffer in HandyRL. Prepare the training data and set the replay_rate in the config. Load the training data to be used by the Replayer class and select the replay data to be used for training in _select_record():
@.***/train_with_replay_same_buffer_simplest
We provide HandyRL as a code base, allowing for customization with minimal code. If you have developed good code, we encourage you to submit a pull request to the HandyRL repository!
— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: @.***>
Many thanks for the great code. I have a question.
How do I insert into the replay buffer? For example, I want to use other agents or random or rule based agents actions=>experience into the replay buffer, especially in the beginning.