acktr Search Results - Githubissues

319 results
for acktr

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ikostrikov/pytorch-a2c-ppo-acktr-gail #271

Error after python main.py --env-name "PongNoFrameskip-v4"

hello, i get this error, when i run code "python main.py --env-name "PongNoFrameskip-v4" i don't know what happed, my env is: python3.6.3 Package Version ----------------- ------- a…

FulChou updated 1 year ago
3
lcswillems/rl-starter-files #13

Support for MiniWorld (3D indoor environment)?

Hi Lucas, I've been working on my 3D indoor environment. It's still very basic, but it works, and I just made the repository public: https://github.com/maximecb/gym-miniworld I've tried to adjus…

maximecb updated 5 years ago
36
ikostrikov/pytorch-a3c #46

Can't work on Ubuntu 16.04

After value, logit, (hx, cx) = model((Variable(state.unsqueeze(0)),(hx, cx))) in train.py, the program doesn't go on. Do you have any idea?

caozhenxiang-kouji updated 1 year ago
22
RL4VLM/RL4VLM #19

Code question

Hello, thank you very much for your wonderful work @YX-S-Z ! I have a question about the COT splitting in RL. In particulaire, in the fragment where you split my the sequence of tokens, shouldn't t…

Serega6678 updated 2 months ago
4
chris-chris/pysc2-examples #4

train mineral shard example is not compatible with baselines…

I can't get started train_minteral_shards.py example. Getting this error: $ python train_mineral_shards.py Traceback (most recent call last): File "train_mineral_shards.py", line 14, in …

snurkabill updated 5 years ago
13
kengz/SLM-Lab #383

Real recurrent policy supported

**Are you requesting a feature or an implementation?** To handle the partial MDP task, the recurrent policy is currently quite popular. We need to add a lstm layer after the original conv (or mlp) …

yangysc updated 5 years ago
2
google-deepmind/pysc2 #88

I have some questions about pysc2 baseline agent of Deepmind…

I have some questions about pysc2 baseline agent of Deepmind. https://deepmind.com/blog/deepmind-and-blizzard-open-starcraft-ii-ai-research-environment/ I'm trying to implement baseline agent il…

chris-chris updated 6 years ago
8
alexfrom0815/Online-3D-BPP-DRL #6

RuntimeError: symeig_cuda: the algorithm failed to converge

Traceback (most recent call last): File "main.py", line 258, in main(args) File "main.py", line 42, in main train_model() File "main.py", line 209, in train_model value_loss, …

nimisha-stellosys updated 3 months ago
5
alexfrom0815/Online-3D-BPP-DRL #22

How to run the code with a2c

Since choosing the default acktr algorithm results in “RuntimeError: symeig_cuda: the algorithm failed to converge”, I chose to run the a2c algorithm, but it still stops with an error. The command to …

SylvanHuang updated 3 months ago
2
RealVNF/distributed-drl-coordination #8

Questions about parameter settings and code

Hello Stefan! We are happy to find your interesting work, and we are conducting further experiments based on your project, I met some problems during the implementation and I carefully read your paper…

daiwenlong23 updated 1 month ago
7

上一页 1...5 6 7 8 9 10 11...32 下一页

319 results for acktr

319 results
for acktr