Open JingdiC opened 3 years ago
Hi JingdiC - Multi-agent reinforcement learning is highly unstable and exhibits large variation between runs and chaotic feed-back loops, and often fails through random variation to get a good initialisation. Try running it multiple times (e.g. 5-10 times) with different initialisation seeds and saving the run that performs the best.
Hi , I run your train_agent file, but I only got 0.01 average reward after 3000 episodes run. Did not get the same results you said. I don't know why.