trpo Search Results - Githubissues

783 results
for trpo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanfordnmbl/osim-rl #224

ValueError while executing act_and_train in TRPO

Traceback (most recent call last): File "TRPO.py", line 169, in action = agent.act_and_train(obs, reward) File "C:\Anaconda3\envs\osim-rl2\lib\site-packages\chainerrl\agents\trpo.py", line 680, in …

mcmips updated 4 years ago
1
sisl/POMDPStressTesting.jl #1

Move TRPO/PPO to new package: PolicyOptimization.jl

The TRPO and PPO implementations are general enough to be in their own solver package in the POMDPs.jl ecosystem. I've already encapsulated these solvers into the DeepRL module. Some TODOs: - [ ] …

mossr updated 4 years ago
3
rll/rllab #51

run trpo_cartpole fail on Mac

RLLAB works fine on the sentOS server, but it does not work on my MAC. When I ran trpo_cartpole.py in examples it stuck here and made no progress: python trpo_cartpole.py /Users/lchenat/anaconda2/li…

lchenat updated 8 years ago
1
rll/rllab #65

Run trpo_swimmer in stub mode

''python example/trpo_swimmer.py'' works well. In the default setting, after 40 iterations it produces 55.72 average reward. When I try to run trpo_swimmer.py in the ''stub'' mode (I simply add ''…

zhuojw10 updated 7 years ago
2
rll/rllab #199

3D Reacher Training TRPO / high perplexity

Hello everybody! I'm trying to train a 6 DOF robotic arm (UR5) to reach a 3D goal in its reachable space with DDPG, TRPO, ecc. I've created my own MuJoCo asset and Gym environment to be launched in …

andreafranceschetti updated 6 years ago
1
rll/rllab #240

ImportError: cannot import name 'MemmapingPool'

Hi, I am getting an error when running the examples: `Traceback (most recent call last): File "rllab/examples/trpo_cartpole.py", line 1, in from rllab.algos.trpo import TRPO File "/…

ioarun updated 12 months ago
8
openai/spinningup #256

Why use subtraction in TRPO parameters update?

Why in TRPO tf1 implementation update is subtracted. Opposite to what is said in paper and SpinUp doc. ``` def set_and_eval(step): sess.run(set_pi_params, feed_dict={v_ph: old_params - alpha …

piojanu updated 4 years ago
2
bstee615/rarl #2

Main agent

Implement the main agent with Trust Region Policy Optimization (TRPO, see [Link](https://arxiv.org/abs/1502.05477)) - [x] Set up InvertedPendulum environment in OpenAI Gym - [x] Set up neural net an…

bstee615 updated 4 years ago
1
rll/rllab #116

Tensorflow TRPO with MountainCar Doesn't consistently conver…

Using Tensorflow TRPO for the OpenAI gym MountainCar-V0 environment doesn't converge every run. Some runs might converge to a good policy. Others will stay at -200 reward forever. Gist of code atte…

Breakend updated 7 years ago
10
madras-simulator/MADRaS #27

TRPO baselines experiment is not running.

I am running the trpo_mpi code on the version1 branch. When I run the experiment it is waiting for a random port. "Waiting for server on 33791..." It shows a different port for multi-threaded. I tri…

amarthyasasi updated 5 years ago
1

上一页 1...1 2 3 4 5 6 7...79 下一页

783 results for trpo

783 results
for trpo