NVIDIA / TorchFort

An Online Deep Learning Interface for HPC programs on NVIDIA GPUs
https://nvidia.github.io/TorchFort/
Other
154 stars 19 forks source link

Tkurth/rl ppo #12

Closed azrael417 closed 4 months ago

azrael417 commented 6 months ago

This PR refactors the RL code for more flexibility. This was necessary for adding on-policy routines, which is also done with PPO along with all C and Fortran wrapper routines. So far, the code still needs to be tested but I wanted to get going on the PR.