tjuHaoXiaotian / GASIL

Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
31 stars 6 forks source link

_update_nomal() in gasil.py line 522 #1

Open miyunluo opened 4 years ago

miyunluo commented 4 years ago

Dear author, thanks for open source your code.

I notice there is a function _update_normal() in gasil.py line 522, which uses the normal replay buffer to do q_train() and p_train(). However, this step seems not appear in the Algorithm 1 in your AAMAS2019 paper. It seems Algorithm 1 only updates the imitation part.

Am I right or I missing something? Could you explain why do _update_normal() before imitation? Thank you very much.