hill-a / stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
http://stable-baselines.readthedocs.io/
MIT License
4.16k stars 725 forks source link

How to convert timestep based learning to episodic learning #1175

Open muk465 opened 2 years ago

muk465 commented 2 years ago

Question I am using a custom environment to to path planning using ddpg algorithm , (i am using stable baslines3)

model = DDPG("MlpPolicy", env, action_noise=action_noise, verbose=1) model.learn(total_timesteps=10000, log_interval=1) model.save("sb3_ddpg_model")

here model.learn is used for timestep based learning but i want to convert it to certain like 3000 steps per episode and have multiple episodes,how can i achieve that?

Missourl commented 1 year ago

Hello,

I also have the same problem, but I have found thoses links. Maybe it could help you. https://github.com/hill-a/stable-baselines/issues/352 https://github.com/hill-a/stable-baselines/issues/776

I still didnt not try the proposed solution, because I need to figure out, how could I combine it (proposed solution), with calling my seed in my env. So everytime I start a new episode "i" , the algo pass the "i" in the seed of my env (seed(i)) so I can sample new reproducible values.

Anyway Good luck .