a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

datawhalechina/easy-rl #40

/chapter7/chapter7

https://datawhalechina.github.io/easy-rl/#/chapter7/chapter7 Description

qiwang067 updated 9 months ago
11
devsisters/DQN-tensorflow #61

Any one who can share model details?

class M1(DQNConfig): backend = 'tf' env_type = 'detail' action_repeat = 1 class M2(DQNConfig): backend = 'tf' env_type = 'detail' action_repeat = 4 I use python m…

Richardxxxxxxx updated 6 years ago
3
dennybritz/reinforcement-learning #30

DQN solution results peak at ~35 reward

Hi Denny, Thanks for this wonderful resource. It's been hugely helpful. Can you say what your results are when training the DQN solution? I've been unable to reproduce the results of the DeepMind p…

nerdoid updated 7 months ago
85
junxiaosong/AlphaZero_Gomoku #26

采用人工对战数据加快收敛可行性分析

您好我现在想采用人工对战的数据用于加速收敛，请问这里人工对战的话，训练时候的mcts_probs_batch概率该如何设定呢，可否让采取当前action的概率为1 其他为0？

apple1987 updated 1 year ago
6
carla-simulator/reinforcement-learning #14

ERROR: (localhost:2000) failed to read data: timed out

Hi, I'm using trying out this code in windows. I always get this error : ERROR: (localhost:2000) failed to read data: timed out. This is the error trace. runfile('C:/Users/cvaram/Documents/CARLA_0.9.…

Chaivara updated 3 years ago
4
ray-project/ray #21917

[Bug] "The kernel has died..." during Ray tune.run

### Search before asking - [x] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues. ### Ray Component Ray Core, Ray Tune ### What happened + What you ex…

gendrelom updated 2 years ago
1
xiaobaishu0097/ICLR_VTNet #2

可以提供一下文章中已经训练好的模型吗？

你好，非常感谢您能分享代码，但是我们在训练结果性能较低，请问可以提供一下您训练好的模型吗？想做进一步的测试，非常感谢。

colinzhaoxp updated 2 months ago
11
google-deepmind/pysc2 #64

Tutorials

Not sure if you are interested but I have written a tutorial for building a basic agent: https://medium.com/@skjb/building-a-basic-pysc2-agent-b109cde1477c https://medium.com/@skjb/building-a-smar…

skjb updated 5 years ago
16
Lightning-Universe/lightning-bolts #280

Shared replay buffer

## 🚀 Feature The RL implementations added do not have the num_workers option. I have a feeling this is because the code doesn't support a shared replay buffer. ### Motivation Adding this would e…

MihaiAnca13 updated 3 years ago
3
ikostrikov/pytorch-a3c #46

Can't work on Ubuntu 16.04

After value, logit, (hx, cx) = model((Variable(state.unsqueeze(0)),(hx, cx))) in train.py, the program doesn't go on. Do you have any idea?

caozhenxiang-kouji updated 1 year ago
22

上一页 1...29 30 31 32 33 34 35...100 下一页

1000+ results for a3c

1000+ results
for a3c