qmix Search Results - Githubissues

starry-sky6688/MARL-Algorithms #115

关于在别的环境应用qmix出现episodes rewards dropout的问题

作者您好，首先感谢您优质的开源项目，代码易读性很高，且迁移到其他的环境非常方便。我将此算法迁移到了highway-env来完成我的项目，但是我始终遇到下面图片这个问题，就是算法先收敛到一个比较好的值，但是持续一会就下降并且震荡幅度很大。我想问一下解决这个问题是提前停止训练（在已经有比较好的结果的时候）吗？我现在在使用默认的2000000步，您在SMAC中遇到过相关的问题吗？ …

ayton-zhang updated 4 months ago

ray-project/ray #9041

[rllib] Error when exporting multiple policies not named def…

### What is the problem? When trying to export models with multiple policies specified using `export_formats=` in `tune.run()`, the programs gives this error about not having a policy named `default_…

rpandya922 updated 5 months ago

f205-ml-cv-lab/weekly-report-for-all-members- #1

2019-0609~2019-0615

ppcd401d2 updated 5 years ago

instadeepai/og-marl #27

Performance drop bewteen offline and online

Thanks so much for your work, I find it very helpful. I am confused with a problem, that I trained with qmix+cql,startcraft v1, 3m scneario, firstly offline and then online. I comment out the trai…

zhonghai1995 updated 5 months ago

ray-project/ray #13405

[rllib] I can't use QMixTrainer trainer.compute_action to ob…

I took TwoStepGame for an example. My code is as follows. I have tried different obs from TwoStepGame or TwoStepGame.with_agent_groups. It always obtained different errors. Can any one tell me how …

imxuemei updated 5 months ago

thu-ml/tianshou #121

How to support multi-agent reinforcement learning

- [ ] I have marked all applicable categories: + [ ] exception-raising bug + [ ] RL algorithm bug + [ ] documentation request (i.e. "X is missing from the documentation.") + [x] ne…

youkaichao updated 1 year ago

ray-project/ray #10510

[rllib] COMA implementation for multi-agent environments

To expand the multi-agent RL algorithms, RLlib would greatly benefit from an implementation of the counterfactual multi-agent policy gradients (COMA) algorithm from Foerster et al. (https://arxiv.org/…

rallen10 updated 5 months ago

zyh1999/CADP #1

KeyError: 'unit_dim'

Hello author, I got this error when I ran the code to replicate the results: marl_env/lib/python3.8/site-packages/gym/logger.py:30: UserWarning: WARN: Could not seed environment warnings.warn…

Isaac009 updated 6 months ago

opendilab/DI-engine #790

how to get the ckpt file?

FileNotFoundError: [Errno 2] No such file or directory: '../exp/MMM/qmix/1/ckpt_BaseLearner_Wed_Jul_14_22_16_56_2021/iteration_9900.pth.tar'

SiriusZbz updated 7 months ago

starry-sky6688/MARL-Algorithms #109

关于qtran的问题

作者您好！我最近在尝试利用您编写的qmix、qtran_base和qtran_alt算法解决无人机路径规划问题。在我做实验的过程中，从实际的表现上来看，qmix算法的性能要高于qtran_base（可能是我的参数设定出现了一些问题）。所以我想请问一下您在实验中qtran算法的实际性能表现是否比qmix要好呢，如果好的话大概好多少此外，qtran_alt是不是比qtran_base难收敛…

Johnson221b updated 8 months ago

343 results for qmix

343 results
for qmix