Training Timesteps for Stable Baselines3 Agent and Data Collection Process for Diffusion Model

arnaudstiegler / gameNgen-repro

Apache License 2.0

5 stars 5 forks source link

Hi,

I'm currently training a stable_baselines3 agent on ViZDoom and came across your gamengen_test_dataset on Hugging Face. I have a couple of questions regarding your setup:

Training Timesteps: How many timesteps did you use to achieve your current results? So far, I've experimented with 1,000,000 and 3,000,000 timesteps.
Data Collection for Diffusion Model: Are you collecting data during the RL agent's training for later diffusion model training, or do you only start data collection after the RL agent is fully trained to ensure the gameplay data is more "human-like"?

Looking forward to your insights.

arnaudstiegler / gameNgen-repro

Training Timesteps for Stable Baselines3 Agent and Data Collection Process for Diffusion Model #1

Hi,