acktr Search Results - Githubissues

319 results
for acktr

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ikostrikov/pytorch-a2c-ppo-acktr-gail #72

ValueError if batch size is smaller than number of mini batc…

The following line returns zero and throws a `ValueError` if the batch size resulting from the number of processes and steps is smaller than the number of mini batches. This happens to me especially i…

timmeinhardt updated 6 years ago
1
ikostrikov/pytorch-a2c-ppo-acktr-gail #89

GRU doesn't work for A2C

First, great work!! I've decided to "upgrade" and use your A2C implementation instead of your A3C's, but I was surprised to see in your code that the changes aren't minor as I thought they would be. …

ShaniGam updated 6 years ago
7
ikostrikov/pytorch-a2c-ppo-acktr-gail #74

Deterministic Policy

Although the current implementation of the policy takes a `deterministic` argument it is never applied and all policy sample random actions even for testing. https://github.com/ikostrikov/pytorch-a2c…

timmeinhardt updated 6 years ago
4
openai/baselines #429

Missing module

I get the following error: ``` Traceback (most recent call last): File "acktr-agent.py", line 10, in from baselines.common.vec_env.subproc_vec_env import SubprocVecEnv File "/Users/cli…

oeclint updated 6 years ago
3
openai/baselines #380

Support for Fetch environments?

It seems like baselines is not directly implemented to deal with `Box()` type action spaces. This same exact code works for the `CartPole` environment. It fails on `FetchReach-v1`. Here is the code…

jeremyf21 updated 5 years ago
12
ikostrikov/pytorch-a2c-ppo-acktr-gail #67

Missing `args.value_loss_coef`?

https://github.com/ikostrikov/pytorch-a2c-ppo-acktr/blob/17ea8333ecbfe6552470f50fab4f83e1444f43a6/main.py#L226

lcswillems updated 6 years ago
2
ikostrikov/pytorch-a3c #49

is ensure_shared_grads still required?

It seems the PyTorch Mnist hogwild example has been updated now, as gradients are now allocated lazily. I think this means that this part of you code is no longer required?

edbeeching updated 6 years ago
6
ikostrikov/pytorch-a2c-ppo-acktr-gail #52

Two questions regarding recurrent policies

I have two questions regarding the implementation of recurrent policies: 1. Why do you have a loop recomputing states in your recurrent policy. It seems you could use the states you already stored …

maximecb updated 6 years ago
3
ngtcp2/ngtcp2 #68

ngtcp2 recv consistent ack packet for a large file

``` I00000009 0x71a94ad414ff12c2 rcv loss_detection_alarm=381524903823256 last_hs_tx_pkt_ts=381524893823256 alarm_duration=10 I00000009 0x71a94ad414ff12c2 frm 2992096611 tx S01(0x1f) STREAM(0x16) id…

scw00 updated 6 years ago
3
ikostrikov/pytorch-trpo #9

doc?

like eg, imagine I have my own policy, that takes in a state, and outputs an action, or perhaps a distribution over actions; and I have a world that takes an action, and returns a reward and a new sta…

hughperkins updated 6 years ago
4

上一页 1...26 27 28 29 30 31 32...32 下一页

319 results for acktr

319 results
for acktr