AI4Finance-Foundation / FinRL

FinRL: Financial Reinforcement Learning. 🔥
https://ai4finance.org
MIT License
9.38k stars 2.28k forks source link

Question : DRL_prediciton : if deterministic=True then no actions are taken after the first one #1139

Open robzsaunders opened 6 months ago

robzsaunders commented 6 months ago

So after training my model for 10,000 steps or 10,000,000 steps, when I go to test it with deterministic = true then the model doesn't take any actions.

It'll just pick a random action, spam it 10 times then output 0 until the episode ends.

I'm not entirely sure why, are there others who have faced this issue and overcame it? I have tried with various settings but in my testing/playing around every single trained model does essentially no active trading or decision making after 10 steps.

There is clear progress with my training with rewards increasing and good actions being made

So I guess my question is: What is a good way to ensure that the model will actually take actions when trying to deterministically test?

michaelkawaicheung commented 3 months ago

Also, it seems that actions_memory does not carry the same information as the actions.