Closed lingjunz closed 4 years ago
Hi there! Thanks for the detailed bug report. It seems like this is potentially an issue with the ppo baseline, which is outside the scope of bsuite.
I do notice you mention that you're using TF2, but as far as I can tell, the OpenAI baselines require TF 1.x to run -- could this be part of the issue?
There is a small problem I had when building PPO OpenAI baseline agent in the bsuite_tutorial.
output input shape is (1, 1)
AssertionError Traceback (most recent call last)