Open DavidLudl opened 1 month ago
Then during the training, do I have to choose batch size with n × T ? Or any other batch size is also OK?
What algorithm are you using? I guess you are talking about on policy algorithm (A2C/PPO) and the n steps parameter?
In that case, you can use any number of steps (this parameter can impact performance though).
I am using PPO. They parameter I want to ask is the batch_size. Should I set the batch_size (default 64) in PPO algorithm to n*T ?
The mini batch size can be kept as is, it is only about the gradient step.
Thank you, now I understand.
❓ Question
Hello.
I would like to ask if I have a finite MDP, where each episode has a same fixed timestep \$T$. Then during the training, do I have to choose batch size with \$n\times T$\? Or any other batch size is also OK?
Thank you for your time,
Best regards,
Checklist