hubbs5 / or-gym

Environments for OR and RL Research
MIT License
373 stars 93 forks source link

Issues with NewsVendor env #31

Open jennafu opened 1 year ago

jennafu commented 1 year ago

I have been attempting to run the NewsVendor environment, with these specific RL and Env configurations (copied from the example notebooks), and it seems like the reward is stuck at around -20,000, after around 500 iterations.

I was wondering what particular configurations I may need to adjust, to see improvements in the reward?

image