trpo Search Results - Githubissues

782 results
for trpo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/baselines #263

Simple way to run trained policy with PPO1/PPO2/TRPO?

First of all, thank you for providing these great baselines! I can train the policies for the various algorithms (PPO1/PPO2/TRPO) and see that average reward increases and loss decreases, but is th…

mfe7 updated 5 years ago
3
Khrylx/PyTorch-RL #4

Training a recurrent policy

I am still struggling with the implementation of a recurrent policy. The trick from [#1](https://github.com/Khrylx/PyTorch-RL/issues/1) worked and I can now start running my RNN GAIL Network. But no m…

erschmidt updated 5 years ago
4
StepNeverStop/RLs #34

Check that the code implementation is accurate and reasonabl…

- [x] check and fix C51 [deaab73] - [x] check qrdqn [deaab73] - [ ] check iqn - [ ] check and fix Rainbow - [ ] check on-policy buffer sampling - [ ] check function `discounted_sum` - [ ] check …

StepNeverStop updated 3 years ago
2
rll/rllab #188

AttributeError: 'LSTMNetwork' object has no attribute 'step_…

I am trying to run a simple Tensorflow example on CartPole with the LSTMNetwork. It appears a critical member is missing from the class. I also get the same error when using the GRUNetwork. > Error…

brett-daley updated 6 years ago
4
807-Girl-Keeper/Task-improments #1

2020-07-31

1、看完了rl 虽然TRPO和PPO还是懵逼 2、看完了吴恩达的第四节 3、别的就没有学习了

Iven-Wu updated 3 years ago
2
Khrylx/PyTorch-RL #26

Various questions?

Hi, Thanks a lot for this extremely useful implementation. I wanted just to ask what is the ZFilter class, is it used to standardize the observed state according to the running mean and std of t…

lviano updated 2 years ago
1
rll/rllab #169

Could not open "params.pkl" after running trpo_cartpole_pick…

Hi, I have just installed rllab envirtonment, and I run the example code trpo_cartpole_pickled.py successfully. And get the log file "debug.log params.pkl progress.csv variant.json". And when I am …

Gin8787 updated 7 years ago
1
tristandeleu/pytorch-maml-rl #66

The progress bar doesn't increase at all

after `pip install -r requirements.txt`, I ran ` python train.py --config configs/maml/halfcheetah-vel.yaml --output-folder maml-halfcheetah-vel --seed 1 --num-workers 8` but progress doesn't i…

seolhokim updated 12 months ago
4
StepNeverStop/RLs #41

实现新的强化学习算法

- MARL: - [x] MADDPG - [x] MASAC 1346949 - [x] IQL - [x] VDN - [x] Q-MIX - [x] Qatten ad8be31 - [ ] MAPPO - [ ] COMA - [ ] QTRAN-alt - [x] QTRAN-base 4c45ba0 - [x] QPL…

StepNeverStop updated 3 years ago
1
openai/baselines #584

How to generater deterministic.ppo2...npz and stochastic.ppo…

Because i want to use ppo2 or trpo to sample a random policy and use gail to imitation learning. Can you share some idea with me? Your help will be my great honor.

huangjiancong1 updated 6 years ago
2

上一页 1...5 6 7 8 9 10 11...79 下一页

782 results for trpo

782 results
for trpo