Closed diffunity closed 3 years ago
Thanks for your report! Would be important to increase the reproducibility.
I've noticed this issue too. Why are the results different even when the seed values have been set? Why close the issue without even a conversation around how any improvements can be made? It is a fantastic project but closing issues like this is not the way to improve it.
Stability and reproducibility are two common issues for the RL community, which is really difficult. If you do more experiments, you may not want to get an answer here at this issue. Even if someone shares tuning skills, it may be misleading.
"Reproducibility of DRLs" is the current hot topic in research, I believe it is better to follow the most recent skills from leading groups, say DeepMind, OpenAI, etc.
Ok - thanks for your answer. That is good to know. Cheers
I was running experiments on simple OHLCV features for DDPG algorithm, and I wanted to reproduce the results. I know DRL is not deterministic and I cannot reproduce the results exactly, but is it supposed to be fluctuating so much? The resulting sharpe ratio ranges from 0.1 to 2.7 after numerous and numerous re-runs.
I followed the tutorial exactly - only change I made was on the dates and the "tech_indicator_list" (which I changed to OHLCV). I have tried setting the seeds in all possible areas (random.seed, np.random.seed, torch.manual_seed, stable baseline seeds, gym env seeds, gym env action space seeds, gym dummy vec env seeds)
below are the two really different results I obtained from running the exact same experiment set up
Below here are how I set the seeds