Chapter 8: run_model - Githubissues

PacktPublishing / Deep-Reinforcement-Learning-Hands-On

Hands-on Deep Reinforcement Learning, published by Packt

MIT License

2.83k stars 1.29k forks source link

Hi!

variable position_steps is a leftover from some experiment I did when I developed the example. In current version it is meaningless and it will be removed.
the agent can issue as many Buy actions as it wants, but the environment ignores such actions if agent already entered the market. This check is implemented here: https://github.com/PacktPublishing/Deep-Reinforcement-Learning-Hands-On/blob/master/Chapter08/lib/environ.py#L92

I haven't checked, but from the common sense, agent should learn this aspect and stop sending Buy action for the second time (the flag which indicates presence of the order is provided with observation). But in the beginning of training, agent could send the action many times, but only the first will be taken into consideration. Of course, this example is very basic and could be extended by making more sophisticated environment model with stop losses, take profit, margin calls, short orders, etc.

Thanks for you interest to the book!

PacktPublishing / Deep-Reinforcement-Learning-Hands-On

Chapter 8: run_model #4