a3c Search Results - Githubissues

1000+ results
for a3c

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mobeets/q-rnn #22

move to policy gradient approach?

Remember that one benefit of policy gradient over Q-learning is that it can learn a stochastic policy. and then we don't have to finetune the exploration during training. note that if we use softma…

mobeets updated 7 months ago
2
cyou6/pingshan_a3c #1

Papers

Hello I am grateful to you for sharing your code Could you please upload papers related to your code?

TrinhTuanHung2021 updated 1 year ago
1
jingweiz/pytorch-rl #14

gpu

can you give me some advice to run this code buy gpu and render the window from the original envs ? thanks

shanxia updated 6 years ago
1
devendrachaplot/DeepRL-Grounding #10

Dimension Error in Inference

Hi I was trying to run ' python a3c_main.py --evaluate 2 --load saved/pretrained_model' to run inference using the pre-trained model. However, I faced the following dimension error without changing…

Mrs-Hudson updated 4 years ago
3
awjuliani/DeepRL-Agents #41

Fails to learn

I ran the notebook without any changes on the vizdoom environment. After around an hour the reward became non-negative and peaked at around 0.7, but continuing to run the code resulted in the reward g…

KeirSimmons updated 6 years ago
8
Shmuma/ptan #6

a2c.py and a2c_atari.py throw error with sample run files

For example when I run a2c.py -r "runs/a2c/a2c_cartpole.ini" tons of errors pop up. Regardless I like that you've implemented a lot of algorithms and put them here. It's very useful for someone new…

ghost updated 5 years ago
1
carla-simulator/carla #3540

terminating with uncaught exception of type clmdep_msgpack::…

This error happened on **CARLA server** when I use leaderboard and scenario runner to create my A3C training environment. Strangely, it appeared a few hours after the start of training. Does anyon…

nuomizai updated 9 months ago
23
hongzimao/pensieve #139

Python 3.x + TensorFlow 2.x

Do you have a version of the code in Python 3.x + TensorFlow 2.x? This'll help me run on a platform that does not have Python 2.7 + TensorFlow 1.1.0.

lixzhang updated 2 years ago
3
ray-project/ray #7518

Reporting Reward Breakdowns

I have a custom environment where the total reward is the sum of intrinsic reward and environmental reward. I've configured the environment to emit the reward breakdowns as: `info = {'agent0' : {'…

gauravg11 updated 3 months ago
2
alifanov/algotrading #1

Training EC Slowly ?

Hi alifanov, Thanks for giving out your code, it's a very good example. I try to train simple-EC , But I feel very slowly . Doesn't it take into account more CPU to train EC by synchronous ? T…

cn3c3p updated 7 years ago
6

上一页 1...20 21 22 23 24 25 26...100 下一页

1000+ results for a3c

1000+ results
for a3c