qmix Search Results - Githubissues

343 results
for qmix

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FLAIROx/JaxMARL #66

Unable to replicate performance with Q-Learning on SMAX

Hi! First and foremost, fantastic work! I'm trying to replicate the performance shown in the paper for the Q-Learning baselines locally; however, using the exact versions provided in your requirements…

corentinartaud updated 8 months ago
5
oxwhirl/pymarl #5

Cannot watch StarCraft II replays

Firstly, I ran command like this: _python3 src/main.py --config=**qmix_smac** --env-config=sc2 with env_args.map_name=3m save_replay=True save_model=True_ and then I loaded model by running:…

Kabibi updated 11 months ago
7
ray-project/ray #16425

[rllib][tune] Training stuck in "Pending" status

Hey everyone, trying to run Ape-X with tune.run() on ray 1.3.0 and the status remains "pending". I get the same message indefinitely == Status == Memory usage on this node: 7.5/19.4 GiB Using…

floepfl updated 8 months ago
12
JohannesAck/tf2multiagentrl #8

PER MADDPG rewards are worse than MADDPG. Why is that?

Hello, Thanks for the tf2.x implementation of MADDPG. Appreciate your effort. I was wondering if you know the reason why does PER MADDPG perform worse than MADDPG in terms of the rewards. I test…

kailashg26 updated 11 months ago
5
opendilab/DI-engine #750

what algorithm do you use to sovle the overcooked problem? M…

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] system worker bug + [ ] system utils bug + [ ] code design/refactor …

frandoLin updated 11 months ago
3
koulanurag/minimal-marl #4

RuntimeError when run idqn, vdn, qmix?

Hi when I run python idqn.py, or vdn.py or qmix.py, there is a rumtime error. "RuntimeError: cannot perform reduction function argmax on a tensor with no elements because the operation does not have …

bradley-code-again updated 1 year ago
4
googlecolab/colabtools #2926

cannot import name 'notf' from 'tensorboard.compat'

I have been training a stylegan2 model for the past month and haven't had any issues until yesterday this occuring: cannot import name 'notf' from 'tensorboard.compat' Any solutions on the way? …

Cubichead updated 7 months ago
5
mitchelloharawild/distributional #52

Vectorisation of distributions and operation arguments

It would be nice to be able to pull multiple quantiles from a single distribution at the same time. ```r dist

davidtedfordholt updated 7 months ago
36
oxwhirl/pymarl #42

If I want to save replay, that means I must install all on w…

Gezx updated 12 months ago
4
starry-sky6688/MARL-Algorithms #100

关于参数reuse_network

作者你好，我想再一下，关于代码中`reuse_network = True`，是代表每个agent都共用一个agnet network对吧？那这样的话，会不会最后造成每个智能体都产生相同的动作呢？因为我在自建的环境中使用qmix算法，整体reward训练后越来越差。而且如果将epsilon定为0之后，好像智能体都趋向于选择相同的动作，而这种相同的动作恰恰会在环境中带来很大的惩罚值。实在不明白是…

zanezhenli updated 1 year ago
3

上一页 1...10 11 12 13 14 15 16...35 下一页

343 results for qmix

343 results
for qmix