Closed franvf closed 4 years ago
Hi, This is pretty similar to the setup in the Basic scene (with slightly different reward values): https://github.com/Unity-Technologies/ml-agents/blob/a6e8a64fd80ccd7a8040252695ff6575a88eb8bb/UnitySDK/Assets/ML-Agents/Examples/Basic/Scripts/BasicAgent.cs#L50-L62 So you might want to check that you're doing the same thing there.
You might also want to check the tensorboard results https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Using-Tensorboard.md to track that the reward gradually increases over time.
This issue has been automatically marked as stale because it has not had activity in the last 14 days. It will be closed in the next 14 days if no further activity occurs. Thank you for your contributions.
This issue has been automatically closed because it has not had activity in the last 28 days. If this issue is still valid, please ping a maintainer. Thank you for your contributions.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
Hi everyone, I work a little with mlagents for make a game where an agent touch one cube, if touch the correct cube done() and reward +1, but if touch the incorrect cube (there is mor than one cube) I call AgentReset() and reward -1.
Well, the problem it's that my agent "dont learn" he touch the incorrect cube 9/10 times, but when touch the correct cube the final reward is positive, I guess that AgentReset() not only reset the scenario of game, but also reset the rewards, and when touch the bad cube 9 times don't acumulate in the total of reward.
How can I acumulate this total rewards?
THX!