PPO implementation - Githubissues

kkuette / TradzQAI

Trading environnement for RL agents, backtesting and training.

Apache License 2.0

165 stars 47 forks source link

Hi man, could you please help me out with PPO? I "ported" q-trader to tensorflow js. It's painfully slow and i'd like to try to implement a different policy to it. Could you please walk me through a simple implementation for trading porpuses (3 actions, 1 stock, etc...) of the PPO?

I'm not a Python coder, and my python reading skills are not that great and for i can't seem to fully understand your code to try and implement it on tfjs.

How does the policy gets updated, what parameters go into it, how much faster is it from q function, etc...

Thanks, Tiago

kkuette / TradzQAI

PPO implementation #7