State Action Encoding in Critic

shariqiqbal2810 / MAAC

Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019

MIT License

645 stars 169 forks source link

I was going through your code and I am having difficult time understanding 1 part of the critic. If you see line https://github.com/shariqiqbal2810/MAAC/blob/1006cffb61e6043872a27956635e199b96b910b2/utils/critics.py#L148

you are just using a state encoding along with the joint embedding of all state-action pair of other agents as an input to the Q-function. If I recall correctly equation 5 takes an embedding of the current agents state-action pair alongside with joint embedding. Can you please explain what is going on here?

shariqiqbal2810 / MAAC

State Action Encoding in Critic #5