marlbenchmark off-policy issues - Githubissues

marlbenchmark / off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

MIT License

378 stars 67 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

mqmix hypernet b2

#17 zcyyyyyyyyyyy opened 7 months ago
0
Questions on the meaning of what wandb records

#16 ciel0906 opened 11 months ago
0
fix typo in README and small bug in clean_smac

#15 jason-huang03 opened 1 year ago
0
fix typo in README

#14 jason-huang03 closed 1 year ago
0
fix(wzl): fix vdn mixer to avoid Q value dim mismatch with reward

#13 zerlinwang opened 1 year ago
0
fix(wzl): change [] to nn.ModuleList in MDDPG_Critic to avoid differe…

#12 zerlinwang opened 1 year ago
0
Can you open-source MASAC code base?

#11 kailashg26 opened 1 year ago
0
Run time

#10 Bruce-Lan-ZY opened 1 year ago
1
RuntimeError: CUDA error: an illegal memory access was encountered

#9 b762927 opened 2 years ago
0
环境可视化问题

#8 Solister00 opened 2 years ago
2
MADDPG rewards are getting higher and higher

#7 ollehhello closed 2 years ago
1
Some questions about the code

#6 rainbow979 opened 2 years ago
1
Error with wandb

#5 Maxtoq closed 2 years ago
3
need a help

#4 zhouweiqing-star opened 2 years ago
2
Bug with idx_range, causing error with Prioritized ER

#3 Maxtoq opened 2 years ago
3
训练奖励越来越低？

#2 HorizonLiang closed 2 years ago
2
typo in train_mpe_maddpg.sh

#1 zbzhu99 closed 2 years ago
0