trpo Search Results - Githubissues

VinF/deer #55

TRPO algorithm

Hi Vince, many thanks to your fantastic work! I would like to know if there is a plant to support the TRPO algorithm? Thanks a lot!

cherishing78 updated 7 years ago

chainer/chainerrl #415

TRPO on Atari

Hi, I am wondering whether chainerrl supports TRPO to run atari? I tried to do so by following the code for training PPO on atari, but I am faced with the following error: > Traceback (most rec…

ling-pan updated 5 years ago

brandontrabucco/mineral #5

Debugging TRPO

Training is very slow, need to check the original paper hyper parameters.

brandontrabucco updated 5 years ago

yunke-wang/WGAIL #2

TRPO import problem

When I run wgail.py, I find that it seems like missing trpo module? Can you provide some details about running this file? Thanks very much!

yixiaoshenghua updated 2 years ago

kindredresearch/SenseAct #45

DDPG + HER to replace TRPO

I want to replace the TRPO with DDPG + HER and am having difficulties. The combination only works with a task that is registered with Gym. How did TRPO avoid that?

hai-h-nguyen updated 5 years ago

joschu/modular_rl #29

Line search does not check KL constraint satisfaction

The TRPO paper (Appendix C) claims that "we use a line search to ensure improvement of the surrogate objective and satisfaction of the KL divergence constraint". However, in the current codebase, the …

zhihanyang2022 updated 1 month ago

sisl/ngsim_env #24

PPO instead of TRPO

Hello, thank you for sharing your code. May I ask a paper question? Since ppo is the upgrade of trpo. Have you considered to use ppo instead of trpo? I am facing this question in my thesis. I wonder…

Kailiangdong updated 5 years ago

jjkke88/RL_toolbox #1

Unable to train model

On executing trpo_continous.py, I get the following error: > [2017-07-01 23:52:58,375] Making new env: CartPole-v0 > [TL] InputLayer continous_shared/continous_input_layer: (?, 3) > [TL…

abhinavrai44 updated 7 years ago

AntakovAndrey/TRPO #2

Владос, сука пидор ебаный сделай БД

Запили сука новый проект в Visual Studio в том же решении. Назови его TRPO.Database. В него перенеси класс Database из проекта TRPO.Services

AntakovAndrey updated 1 year ago

openai/spinningup #264

Error in TRPO KL Divergence Calculations

Apologies if I am misunderstanding something but it seems that the direction of the KL divergence calculations used throughout the TRPO code seems to be at odds with the TRPO paper. Instead of KL[new …

jamesborg46 updated 2 years ago

783 results for trpo

783 results
for trpo