godka / Pensieve-PPO

The simplest implementation of Pensieve (SIGCOMM' 17) via state-of-the-art RL algorithms, including PPO, DQN, SAC, and support for both TensorFlow and PyTorch.
https://godka.github.io/Pensieve-PPO/
BSD 2-Clause "Simplified" License
65 stars 31 forks source link