[Feature Request] More general reward

facebookresearch / mbrl-lib

Library for Model Based RL

MIT License

945 stars 154 forks source link

Hi @mkolodziejczyk-piap. This is an interesting suggestion. Can you give a more concrete example to help me sketch out something?

As a starting point, in the current state of the code it should already be possible to use a reward_fn that is a class, as long as you implement a __call__ method with the same inputs (actions and next observation). This will allow you to keep some internal state, but depending on how you'd to use the ModelEnv you may need to have your own version of evaluate_action_sequences.

I'm happy to take a deeper look at this with more details in hand.

facebookresearch / mbrl-lib

[Feature Request] More general reward #115