Resolved issue around inability to evaluate and overflow in sigmoid. Also added a few lines that I missed in merge last night.

llSourcell / Reinforcement_Learning_for_Stock_Prediction

This is the code for "Reinforcement Learning for Stock Prediction" By Siraj Raval on Youtube

641 stars 363 forks source link

Thanks for your sharing.

This modification can tackle no profit due to a "Buy" never occurring. But I don't think it really solve the core problem: why can't a trained agent take a proper(buy) action even in the trained data? I found this problem because of building a sell agent(attached code). Originally, the agent must have a “Buy” position, and then it can take “Sell” action. Likewise, we can easily modify the code to build an agent that it must have “Sell” position first. But I tried many efforts, the agents only take “Buy” action even if I forced it to sell at the first time. No matter how it shows good performance during training processes, it seems not to transfer to evaluation process.

If we can make it work, this example actually shows how we can deal with “Environment”. It is usually quite difficult to model environment in financial markets. agent_sell.zip

llSourcell / Reinforcement_Learning_for_Stock_Prediction

Resolved issue around inability to evaluate and overflow in sigmoid. Also added a few lines that I missed in merge last night. #8