AI4Finance-Foundation / FinRL

FinRL: Financial Reinforcement Learning. 🔥
https://ai4finance.org
MIT License
9.65k stars 2.34k forks source link

All episodes SAME REWARD! #713

Open skhonsha opened 2 years ago

skhonsha commented 2 years ago

Dears, Thank you for framework, Please see the output of hyperparameter training on SB3 algorithm, why the reward in all episodes doesnt change, whats the problem?(I copied only three outputs)The reward output is the same in every study for any hyperparameter configuration


| time/ | | | episodes | 172 | | fps | 92 | | time_elapsed | 1014 | | total_timesteps | 93568 | | train/ | | | actor_loss | 5.13 | | critic_loss | 7.43 | | ent_coef | 0.00201 | | learning_rate | 0.0148 | | n_updates | 93447 | | reward | -3.8389 |


| time/ | | | episodes | 176 | | fps | 92 | | time_elapsed | 1039 | | total_timesteps | 95744 | | train/ | | | actor_loss | 4.84 | | critic_loss | 39.8 | | ent_coef | 0.00201 | | learning_rate | 0.0148 | | n_updates | 95623 | | reward | -3.8389 |


| time/ | | | episodes | 180 | | fps | 92 | | time_elapsed | 1062 | | total_timesteps | 97920 | | train/ | | | actor_loss | 4.44 | | critic_loss | 3.66 | | ent_coef | 0.00201 | | learning_rate | 0.0148 | | n_updates | 97799 | | reward | -3.8389 |

YangletLiu commented 2 years ago

@skhonsha Please give more details, since we did not encounter it before. Which code file are you using?

KoperSloper commented 8 months ago

Have you been able to find a solution?