[Question] Unnecessary Hyperparameter Values for DQN Cartpole-v1

❓ Question

I'm looking at the hyperparameters for DQN for Cartpole here, and am seeing something confusing.

The number of timesteps is set to 50k, but the buffer size is set to 100k. Should the buffer size go to 50k? It shouldn't be a big issue, but it is confusing.

Checklist

[X] I have checked that there is no similar issue in the repo
[X] I have read the SB3 documentation
[X] I have read the RL Zoo documentation
[X] If code there is, it is minimal and working
[X] If code there is, it is formatted using the markdown code blocks for both code and stack traces.

DLR-RM / rl-baselines3-zoo

[Question] Unnecessary Hyperparameter Values for DQN Cartpole-v1 #380

❓ Question

Checklist