issues
search
crowdAI
/
marLo
Multi Agent Reinforcement Learning using MalmÖ
MIT License
245
stars
46
forks
source link
Reward structure adjustment for basic sp task
#28
Closed
rdgain
closed
6 years ago
rdgain
commented
6 years ago
Rewards scaled to fit between [-1, 1].
Rewards scaled to fit between [-1, 1].