crowdAI / marLo

Multi Agent Reinforcement Learning using MalmÖ
MIT License
245 stars 46 forks source link

Reward structure adjustment for basic sp task #28

Closed rdgain closed 6 years ago

rdgain commented 6 years ago

Rewards scaled to fit between [-1, 1].