mountaincar-v0 Search Results

151 results
for mountaincar-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

inoryy/tensorflow2-deep-reinforcement-learning #6

Memory leak issue

Hello, Thanks for your great work, I am trying to learn tensorflow and keras from your tutorial code. When I used this A2C code to tackle the "MountainCar-v0" problem, I found that the RAM occupation…

52hpfans updated 4 years ago
2
openai/gym #1294

Is MountainCar-v0 harder?

I've implemented Cross-Entropy method and passed CartPole-v0 and MountainCarContinuous-v0 with the same hyperparameters and default reward definition from gym. But it didn't work on MountainCar-v0 eve…

samar1tan updated 4 years ago
3
openai/spinningup #122

ValueError: Input 0 of layer dense_1 is incompatible (Spinni…

**Summary:** I've noticed that the Spinning Up algorithm implementations don't seem to support discrete **observation** spaces defined with [`gym.spaces.Discrete`](https://github.com/openai/gym/blob/m…

aaronsnoswell updated 4 years ago
2
adik993/ppo-pytorch #3

How many episodes are needed to solve MountainCar-v0 with PP…

I tried your `run_mountain_car.py`, but the accumulated rewards do not change at all. Are there any hyper-parameters that I need to change? And how many episodes are needed in general?

speedcell4 updated 5 years ago
1
openai/gym #363

All pages under gym.openai.com have the same title

I've had a bunch of gym tabs open over the last couple days, and finding the one I want is really hard because they're all called "OpenAI Gym". It'd be great if pages had more descriptive titles like,…

colinmorris updated 4 years ago
1
MrRobb/gym-rs #7

Not all `id`s from `OpenAI` recognised

I have been playing with `basic-rs` replacing `CartPole-v0` with envs from [OpenAI](https://gym.openai.com/envs/). Some work some do not. The ones that fail seem to fail here: I had errors with: …

worikgh updated 4 years ago
4
hill-a/stable-baselines #198

[feature request] Implement goal-parameterized algorithms (H…

I'd like to implement Hindsight Experience Replay (HER). This can be based on a whatever goal-parameterized RL off-policy algorithm. **Goal-parameterized architectures**: it requires a variable for…

ccolas updated 4 years ago
22
rll/rllab #216

when the rllab trpo code is applied to the mountain-car env.…

when the rllab trpo code is applied to the mountain-car env., it does not climb the mountain well until 500 iterations. It is strange since the TRPO algorithm implemented by OpenAI (https://github.…

haanvid updated 4 years ago
6
wandb/wandb #286

UnicodeEncodeError when using python 2.7 wandb client

* Weights and Biases version: 0.6.32 * Python version: 2.7 * Operating System: OSX ### Description Error because of how unicode is handled in python 2. I had a unicode character in the descrip…

syllogismos updated 4 years ago
1
HumanCompatibleAI/imitation #56

Generate expert demonstrations and upload to S3

Checklist: - [x] Parallelized training for experts (Adam finished this via #57). - [x] Get good experts for Humanoid and Ant [which are not doing well right now](https://github.com/HumanCompatib…

shwang updated 4 years ago
8

上一页 1...9 10 11 12 13 14 15...16 下一页

151 results for mountaincar-v0

151 results
for mountaincar-v0