Kaixhin / imitation-learning

Imitation learning algorithms
MIT License
406 stars 39 forks source link

How to judge the convergence #8

Closed ghost closed 2 years ago

ghost commented 2 years ago

Hello!

Thanks so much for sharing the code!

I am new at inverse reinforcement learning. Now I am trying to apply AIRL and GAIL to a customized environment without knowing anything about the reward function. So are there any metrics that can be used to judge the convergence except for rewards?

Thanks ;).

Kaixhin commented 2 years ago

Yes, although this is difficult, Hyperparameter Selection for Imitation Learning proposes some.