hi,i am trying to reproduce the battle game (red army use MFQ and blue army use DQN.) i trained the model and find both algorithums has a negative reward.and the agents number of two side has not decreased in the battle which is confused.can u help me?
hi,i am trying to reproduce the battle game (red army use MFQ and blue army use DQN.) i trained the model and find both algorithums has a negative reward.and the agents number of two side has not decreased in the battle which is confused.can u help me?