nexus-rl / project-x

0 stars 0 forks source link

Reward shaping #4

Open some-rando-rl opened 2 years ago

some-rando-rl commented 2 years ago

What should our rewards be?

How do we want to evaluate them?

Do we want to normalize rewards? What about advantage?