Tkurth/rl ppo - Githubissues

NVIDIA / TorchFort

An Online Deep Learning Interface for HPC programs on NVIDIA GPUs

https://nvidia.github.io/TorchFort/

Other

154 stars 19 forks source link

Tkurth/rl ppo #12

Closed azrael417 closed 4 months ago

azrael417 commented 6 months ago

This PR refactors the RL code for more flexibility. This was necessary for adding on-policy routines, which is also done with PPO along with all C and Fortran wrapper routines. So far, the code still needs to be tested but I wanted to get going on the PR.