Performance Clarification

wassname / rl-portfolio-management

Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)

551 stars 180 forks source link

I'm kinda confused with your notes on performance. In your README, you mention that the VPG under-performs on the test set against baseline. But then in the notebook, you mention that you get an annualized 25X and you are using PPO -- but that doesn't appear to be the case judging by the test performance plot included at the bottom of the notebook.

I'm currently running my own implementation and am waiting for it to finish training, but I'm curious -- were you able to get this working? I know the PPO algo is fairly new so maybe those notes were left over from your initial testing? Any clarification would be helpful :)

Thanks!

wassname / rl-portfolio-management

Performance Clarification #11