Open ohmahgawditbob opened 3 years ago
Really, TRPO only needs the following from a gym env:
env.reset()
env.render()
)env.step(action.flatten()'
We just need to make some functions to replace these function calls.
Phew! https://github.com/LouisFoucard/gym-blender will allow me to not have to completely demolish the TRPO code. That was a close one.
This will be an especially interesting task since i believe it was originally made for OpenAI gym sessions, which I do not think we should try to cobble together a data structure to spoof a Gym.
We will need to:
This list is likely to change once discussion opens, bringing out all of the mindless things I had stated in the original To-do list.