trpo Search Results - Githubissues

783 results
for trpo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Stable-Baselines-Team/stable-baselines3-contrib #115

[Feature request] Implement OT-TRPO

Hi, we developed and tested our algorithm OT-TRPO (published at the upcoming NeurIPS2022, you can find the preprint [here](https://arxiv.org/abs/2210.11137)) using stable baselines. Is there an …

antonioterpin updated 1 year ago
3
GazzolaLab/Elastica-RL-control #2

I run %run policy_training_script.py and It stops after thes…

... python3 logging_bio_args.py --total_timesteps=10000000.0 --SEED=4 --timesteps_per_batch=4000 --algo=TRPO python3 logging_bio_args.py --total_timesteps=10000000.0 --SEED=4 --timesteps_per_batch=8…

LEESHOHYUN updated 3 years ago
1
rll/rllab #155

Saving/Storing TRPO optimal policy

Hi, Is there any simple way in rllab to save the optimal learnt policy? For example, using TRPO, I want to save the already learnt policy, so that I can simply look into the trajectory/path of …

Riashat updated 7 years ago
1
araffin/rl-baselines-zoo #59

TRPO "underflow encountered in multiply"

While running a TRPO train, after some time (random - anywhere from 15sec to 1min) it kicks with the following: `Traceback (most recent call last): File "callback.py", line 196, in model.lea…

jarlva updated 4 years ago
2
openai/baselines #61

Benchmarking for PPO and TRPO

Thanks to the OpenAI team for the latest release! Are there any benchmark results (like Atari score) on PPO and TRPO? DQN has a report here: https://github.com/openai/baselines-results. It's super…

miriaford updated 5 years ago
5
whoisthisadam/trpo-practice #1

Task 5

https://github.com/whoisthisadam/trpo-practice/blob/6c0fbddbf4d11896c28e7a22223c800c3ad18a22/trpo%20pz5.cpp#L127 to_string() нельзя, надо свою функцию использовать

Virotor updated 3 years ago
3
kgex/developer-roadmap #494

Add Trust Region Policy Optimization (TRPO) resource

DineshkumarS05 updated 1 year ago
7
kgex/developer-roadmap #501

Add Trust Region Policy Optimization (TRPO) resource

DineshkumarS05 updated 1 year ago
4
FTC-8856/FTC-Robot-Controller #1

Implement TRPO in Blender Game Engine tensorflow runtime

This will be an especially interesting task since i believe it was originally made for OpenAI gym sessions, which I do not think we should try to cobble together a data structure to spoof a Gym. We…

ohmahgawditbob updated 3 years ago
2
stanfordnmbl/osim-rl #224

ValueError while executing act_and_train in TRPO

Traceback (most recent call last): File "TRPO.py", line 169, in action = agent.act_and_train(obs, reward) File "C:\Anaconda3\envs\osim-rl2\lib\site-packages\chainerrl\agents\trpo.py", line 680, in …

mcmips updated 4 years ago
1

上一页 1...1 2 3 4 5 6 7...79 下一页

783 results for trpo

783 results
for trpo