mountaincar-v0 Search Results

151 results
for mountaincar-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/rl-baselines3-zoo #121

[Enhancement] Support copying optuna params dict for all hyp…

Right now, only hyperparmeters that are searched by default can have their params dict be copied and reused due to naming issues. This should be extended to hyperparameters that are not searched by de…

jkterry1 updated 3 years ago
4
adik993/ppo-pytorch #2

How can I get your result in tensorboard without early endin…

I tried your script in mountaincar env and It seems that the game ends when the step length reaches 200 per episode, but in your tensorboard plots, an episode didn't stop until it reached the final st…

GodZarathustra updated 5 years ago
3
openai/gym #3201

ALE Namespace does not exist.

I am attempting to access the Atari environments, and upon importing the latest versions of ale-py, autorom, gym, gymnasium even, I get the following error when attempting to make and environment of a…

Nervoload updated 7 months ago
14
openai/baselines #690

DDPG with default hyper-paras doesn't work in mujoco swimmer…

Hi all, I have run DDPG with default hyperparameters in mujoco swimmer-v2 environment, but the reward converges to a very low value, only 4 or 5, so the swimmer cannot swim at all. I did not change th…

SiyuanLee updated 5 years ago
5
adibyte95/Mountain_car-OpenAI-GYM #1

Fundamentally different problem

Is this not a flawed approach since you have fundamentally changed the problem by altering the reward? To compare to other solutions the environment should be the same (including rewards given)

lpbrown999 updated 5 years ago
3
Stable-Baselines-Team/stable-baselines3-contrib #201

[Feature Request] Implement Recurrent SAC

### 🚀 Feature Hi! I would like to implement a recurrent soft actor-critic. Is it a sensible contribution? ### Motivation I actually need this algorithm in my projects. ### Pitch The sb3 e…

masterdezign updated 6 months ago
17
openai/vime #3

Could you add some information to recreate the state space f…

This is the figure I'm referring to:

mrdrozdov updated 6 years ago
5
DLR-RM/stable-baselines3 #1150

Training exceeds total_timesteps

### ❓ Question Consider this setup: ```python import stable_baselines3 import gym from stable_baselines3 import DQN, A2C, PPO #from sb3_contrib import ARS, TRPO env = gym.make('MountainCar-v0…

timosturm updated 8 months ago
1
hill-a/stable-baselines #381

[Feature Request] Interaction Drivers/Runners

Currently the way to train an agent is to 1) Instantiate the environment, 2) Instantiate the agent, passing the environment in constructor and 3) calling the learn method. Some agent frameworks hav…

jmribeiro updated 5 years ago
10
luchris429/purejaxrl #25

S5: Longer compilation times

Hey, thanks for providing purejaxrl is pretty awesome. I have used the experimental `S5` code that you provide for a part of my research and after version 0.4.27 (same for 0.4.28) of `jaxlib` I hav…

stergiosba updated 4 months ago
3

上一页 1...1 2 3 4 5 6 7...16 下一页

151 results for mountaincar-v0

151 results
for mountaincar-v0