mountaincar-v0 Search Results

151 results
for mountaincar-v0

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ray-project/ray #9245

[tune] BOHB with DQN and MountainCar-v0 fails

### What is the problem? I am using BOHB to optimize the hyperparameters for the DQN algorithm in order to solve the MountainCar-v0 problem. I alway run into the following issue (even if I use…

ElektroChan89 updated 4 years ago
2
HumanCompatibleAI/imitation #70

AIRL benchmarking against adamgleave/inverse_rl

To benchmark AIRL, I’m (planning on) comparing imitation performance of our new AIRL implementation on modern gym envs against the performance of the old AIRL implementation (on adamgleve/inverse_rl) …

shwang updated 3 years ago
3
DLR-RM/rl-baselines3-zoo #41

Tensorboard log broken while hyperparameter optimization

**Describe the bug** Hyperparameter optimization breaks the tensorboard logging. When it is active and multiple optimization jobs are running, all datapoints are logged to the last job's tensorboard.…

SammyRamone updated 3 years ago
5
openai/gym #336

timestep_limit of MountainCar-v0

Currently in the MountainCar-v0 environment, the [timestep_limit is 200](https://github.com/openai/gym/blame/master/gym/envs/__init__.py#L70) which makes learning very difficult: most initial policies…

falcondai updated 4 years ago
17
thu-ml/tianshou #101

Does the current PPO support discrete actions?

Hi, how an exciting work you have done in tianshou! But there still some doubts while I use the code in my experiments. I have found that there are no different in the continues version PPO and discre…

minruixu updated 4 years ago
2
araffin/rl-baselines-zoo #87

Use of normalization seems to be weird with hyperparameter t…

**Describe the bug** When we use reward normalization is expected that evaluations are done with original reward values. And this is actually done for training (train.py: lines 291-298). But evalua…

caburu updated 4 years ago
2
openai/gym #1905

Episode Termination condition for MountainCar v0

The wiki of MountainCar v0 is saying that the episode ends when you reach 0.5 position, or if 200 iterations are reached. But I didn't find any condition check for the number of iteration in the code.…

yibingwei-1 updated 4 years ago
3
StepNeverStop/RLs #18

Exception during running MountainCar-v0 case with ppo

Thanks for your developement it seems to be inspiring project! Although when I tried to launch a command from Examples: `python run.py --gym -a ppo -n train_using_gym --gym-env MountainCar-v0 --rend…

acriptis updated 4 years ago
4
hill-a/stable-baselines #814

Better argparse description in train_mountaincar.py

**Improving description for `argparse.ArgumentParser` in `stable_baselines\deepq\experiments\train_mountaincar.py`** Line 34 in the file goes like ```python parser = argparse.ArgumentParser(de…

aakash94 updated 4 years ago
2
hill-a/stable-baselines #417

Question with HER: AttributeError: 'Box' object has no attri…

Hello, I have a problem when I tried to use DDPG + HER. The problem seems the definition of observation_space. ```bash Traceback (most recent call last): File "/home/all-jy/git/jy_gym_stfl/tr…

yjc765 updated 4 years ago
4

上一页 1...7 8 9 10 11 12 13...16 下一页

151 results for mountaincar-v0

151 results
for mountaincar-v0