What's the meaning of return_mean and return_std? / What's the function of rnn agent?

oxwhirl / pymarl

Python Multi-Agent Reinforcement Learning framework

Apache License 2.0

1.89k stars 386 forks source link

Hi, thanks for this repo! I have been reading the source code pf pymarl and I have a few questions.

In the output of the program, there are a few parameters like return_mean. I understand most of them but I have trouble understanding return_mean and return_std. What's the meaning of return? (I guess may be the calculation of value function.) And how do you calculate returnalong with return_mean and return_std?

The other question is "why do we use rnn agent?". When I search the word rnn in this repo, I didn't find codes about how the rnn is used in training agents. And when we use algorithms like qmix, is the system still using rnn agent or the system use qmix agent(like overwriting rnn agent).

Thanks again for this repo!

oxwhirl / pymarl

What's the meaning of return_mean and return_std? / What's the function of rnn agent? #10