trpo Search Results - Githubissues

783 results
for trpo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

wojzaremba/trpo #6

Normalize advantage function

Hi, thanks for your implementation of TRPO. In [https://github.com/wojzaremba/trpo/blob/master/main.py#L128-L132](url) you normalize an advantage function. I couldn't find any description about th…

rarilurelo updated 7 years ago
2
ikostrikov/pytorch-trpo #22

It seems that the importance sampling code part is wrong.

https://github.com/ikostrikov/pytorch-trpo/blob/e200eb8a23b3c7941a0091efb9750dafa4b23cbb/main.py#L108-L119 The fixed log prob part of the line and the "get_loss" function part are exactly the sam…

yhy258 updated 1 month ago
2
facebookresearch/ReAgent #213

Current plans or progress for TRPO and PPO

Im just curious if there is any effort to add these policy gradient methods ?

balloch updated 4 years ago
1
rll/rllab #67

Cannot import name 'monitor' for trpo_gym

I successfully installed rllab, and the trpo_cartpole.py example worked fine, but I'm getting an error with trpo_gym.py: ``` Traceback (most recent call last): File "trpo_gym.py", line 6, in …

ashwinreddy updated 6 years ago
8
IBM/rl-testbed-for-energyplus #20

program stops at iteration 31

Hi, I am running codes without changes. when running "time python3 -m baselines_energyplus.trpo_mpi.run_energyplus --num-timesteps 1000000000" the program stops at iteration 31. the problem is abo…

xiaonanchong updated 2 months ago
2
rlworkgroup/garage #2321

KeyError: 'render.modes' in GymEnv wrapping "CartPole-vX"

Apparently, classic control environments in Gym have a different key for render modes in `env.metadata`. In fact: ```python >>> import gym >>> env = gym.make('CartPole-v1') >>> env.metadata {…

AndreaFinazzi updated 2 years ago
2
openai/baselines #489

TypeError: 'numpy.float64' in GAIL dataset loading

https://github.com/openai/baselines/blob/f2729693253c0ef4d4086231d36e0a4307ec1cb3/baselines/gail/dataset/mujoco_dset.py#L53 When I run run_mujoco.py, the code merged in #447 is currently erroring o…

hollygrimm updated 5 years ago
2
IBM/rl-testbed-for-energyplus #88

How to load PPO2 model

Hi, I shifted from TRPO to PPO2, and I know how to train the model and save it and then re-train the saved model in TRPO. But, in PPO2, I don't know the code for it. @antoine-galataud you had on…

yashviagrawal updated 2 years ago
4
steveKapturowski/tensorflow-rl #7

Can't test CartPole-v0 model trained with TRPO

I've tried to use TRPO to create a model for `CartPole-v0` by following the instructions on your [OpenAI Gym page](https://gym.openai.com/evaluations/eval_4QXCRAATTDqakJV0YZlJ4g#reproducibility), chan…

captify-alapite updated 7 years ago
3
pat-coady/trpo #34

Saving and Loading trpo model for policynn

Hello, I'm sorry for asking so many questions. But would you know how to go about saving and loading the subclassed model? I have tried everything I can think of including trying to change the subc…

ryanmaxwell96 updated 4 years ago
1

上一页 1...1 2 3 4 5 6 7...79 下一页

783 results for trpo

783 results
for trpo