ppo Search Results - Githubissues

1000+ results
for ppo

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

linyiLYi/snake-ai #2

可以test，无法训练，报错

(SnakeAI) E:\snake-ai-master\main>python train_cnn.py Using cuda device Wrapping the env in a VecTransposeImage. Process SpawnProcess-5: Traceback (most recent call last): File "C:\Users\KEN202…

aijunzhao updated 1 year ago
18
xiang578/xiang578.github.io #88

李宏毅强化学习课程笔记 | 算法花园

https://xiang578.com/post/reinforce-learnning-basic.html Info 课件下载：Hung-yi Lee - Deep Reinforcement Learning 课程视频：DRL Lecture 1: Policy Gradient (Review) - YouTube Change Log 20191226: 整理 PPO 相关资…

xiang578 updated 4 years ago
1
gijskoning/Reproducibility_project #1

research first model

Might be good to first start with only the FNN. Also found out that it is better to start working with the Traffic Control environment since that model is a lot smaller.

gijskoning updated 3 years ago
1
google-deepmind/mujoco #1636

Error when running training_apg.ipynb: ValueError: safe_zip(…

Hi, I'm a student trying out MJX for some projects. I was looking at [training_apg.ipynb](https://github.com/google-deepmind/mujoco/blob/main/mjx/training_apg.ipynb) and tried running it on my comp…

charles-zhng updated 1 month ago
5
X-Sharp/XSharpPublic #1487

Problem with Extended Expression Match Marker (Xbase++)

**Describe the bug** Unfortunately, the [problem](https://github.com/X-Sharp/XSharpPublic/issues/1073) with the extended expression match marker isn't resolved. **To Reproduce** ``` #translate P…

DenGhostYY updated 13 hours ago
3
BarisYazici/deep-rl-grasping #29

PPO algo performs badly

The PPO algorithm is difficult to converge，and the gripper always move up and away from the table.Could you please give me some hint about it.Sincerely appreciate it!

HarrisonC7 updated 1 month ago
2
lefnire/tforce_btc_trader #11

Try ray/RLlib

[Update 2018-07-27] Update: seems Coach has slowed down (w/o much community), and rllab has stopped. A more recently popular framework is [rllib](http://ray.readthedocs.io/en/latest/rllib.html) (one l…

lefnire updated 5 years ago
2
DLR-RM/stable-baselines3 #914

Supporting PyTorch GPU compatibility on Apple Silicon chips

### 🚀 Feature PyTorch recently released support for GPU acceleration using the Apple Silicon chips. This should be supported in stable-baselines3 by the `"mps"` device (I believe). ### Minimal E…

ryanrudes updated 6 months ago
17
dennybritz/reinforcement-learning #238

Reinforcement learning policy

I want to make a project using reinforcement learning in which a bot send scam to other bots on social media, other bots detect the scam and reject it. I think it needs a deep reinforcement learning…

Comp-Engr18 updated 2 months ago
1
moripiri/DiverseRL #32

configuration management + training flow change

expected outcome ``` from diverserl.algos import PPO if __name__ == '__main__': args = get_args() algo = PPO(**args) algo.train() ``` use hydra

moripiri updated 3 weeks ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for ppo

1000+ results
for ppo