marlbenchmark / off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
MIT License
386 stars 67 forks source link

Some questions about the code #6

Open rainbow979 opened 2 years ago

rainbow979 commented 2 years ago

why did the code require only one env when using rnn policy? https://github.com/marlbenchmark/off-policy/blob/release/offpolicy/scripts/train/train_mpe.py#L154

Solister00 commented 2 years ago

我也想问一样的问题