ikostrikov pytorch-a2c-ppo-acktr-gail issues

ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

MIT License

3.52k stars 831 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Updates: Support the latest Atari environments and state entropy maximization-based exploration

#298 yuanmingqi opened 2 years ago
0
Update: Support the latest Atari environments.

#297 yuanmingqi closed 2 years ago
0
Updates: Support the latest Atari environment and state entropy maximization-based exploration.

#296 yuanmingqi opened 2 years ago
0
Yml support

#295 DexiongYung closed 2 years ago
0
Why didn't run to generate log?

#294 Can-no opened 2 years ago
0
Why is episode_rewards negative when running ant_v3 with PPO?

#293 Can-no opened 2 years ago
0
Where are the experts data for GAIL get from?

#292 YY-GX opened 2 years ago
0
setup.py and requirements.py have same dependencies except for h5py

#291 andyk opened 2 years ago
0
Oops! wrong repo :-D

#290 andyk closed 2 years ago
1
question about the recurrent

#289 rainbow979 closed 2 years ago
1
[Question]Can I use Recurrent_policy for GAIL at this implementation?

#288 LongchaoDa opened 2 years ago
0
fix value loss coefficient

#287 dmitrySorokin opened 2 years ago
0
add pybullet installation

#286 lijiyao919 opened 2 years ago
0
add h5py in requirement.txt and add conda intall atari in readme

#285 lijiyao919 closed 2 years ago
0
why PPO needs to store action_log_probs instead of using stop_gradient for better efficiency?

#284 Emerald01 opened 2 years ago
1
object has no attribute 'steps' in acktr

#283 sungreong opened 2 years ago
0
No softmax before categorical loss?

#282 nirweingarten opened 2 years ago
0
Operations that have no effect

#281 ArashVahabpour opened 2 years ago
0
CNN Architecture

#280 araffin opened 3 years ago
0
Possible bug on the sign of policy log prob. in Fisher computation

#279 daniloefl opened 3 years ago
0
Stale hidden states

#278 aklein1995 opened 3 years ago
0
Can not run enjoy.py

#277 juanjuan2 opened 3 years ago
0
Can I train in my own game

#276 hhhcwb38712 opened 3 years ago
0
Why acktr algorithm cannot be used in Mujoco settings?

#275 ChenDRAG opened 3 years ago
0
observation reset before insert

#274 seed851218 opened 3 years ago
0
In FixedNormal method, 'entrop' is typed wrong

#273 xkianteb closed 3 years ago
0
does mask introduce bias in the gail implementation ?

#272 HareshKarnan opened 3 years ago
0
Error after python main.py --env-name "PongNoFrameskip-v4"

#271 FulChou opened 3 years ago
3
change self.clipob to self.clip_obs

#270 HareshKarnan closed 3 years ago
0
Combine Acktr model with grad-cam

#268 seed851218 opened 3 years ago
2
New parallel PyTorchRL library based on this one

#267 giadefa opened 3 years ago
0
Suggestion - implement some "tricks" that improve performance

#266 henrycharlesworth opened 3 years ago
1
ob_rms_to_obs_rms

#265 hotco87 closed 3 years ago
0
I converted your implementation to tensorflow but it does not work

#264 ghost closed 3 years ago
0
enjoy.py failes. Unexpected argument 'ret'

#263 jakefoster954 closed 3 years ago
2
Unable to run enjoy.py

#262 jakefoster954 opened 3 years ago
1
Can't access to the trianed model files.

#261 TigerVersusT opened 3 years ago
3
PPO Not Converge for Pendulum-v0

#260 ZhizhenQin opened 3 years ago
0
Does setting the flag "use-proper-time-limits" to be True recommended for all gym environments with time limit?

#259 PeixinC closed 3 years ago
1
Fix macOs fork safety issue

#258 ghost opened 3 years ago
0
Generates a sequence of objc errors on macOs Big Sur

#257 ghost opened 3 years ago
0
Running main.py in PyCharm, results in BrokenPipeError or EOFError

#256 ghost closed 3 years ago
0
modify readme for dependencies

#255 vwxyzjn closed 3 years ago
0
replace baselines dependencies

#254 vwxyzjn closed 3 years ago
3
assert 'NoFrameskip' in env.spec.id

#253 liuqi8827 closed 3 years ago
2
adaptive adam learning rate

#252 a-z-e-r-i-l-a opened 3 years ago
0
should h5py be listed as dependency?

#251 suliuzh opened 3 years ago
0
What can compute_grad_pen in gail.py do?

#250 ruleGreen opened 3 years ago
0
EOFError when entering a subprocess worker

#249 Artimisu opened 3 years ago
0
Massimo

#248 optimass closed 3 years ago
0