Closed ejunprung closed 2 years ago
@ejunprung I could create a second csv on top of the first one in this case. This way you can have both. In principle I could even write an Excel workbook with multiple sheets, but I don't want to exclude users without Office.
What do you think?
@maxpumperla Nice, separate CSVs are perfect. Just one nitpick.
It looks like final rewards are summed together.
I think the reward for each episode should be independent of each other.
@ejunprung have you tried to see if it does the same thing in other simulations?
Not yet. Let me try with Zinc Factory in a bit.
I made a pr to fix the summary table #16
The MC (>1 episodes) seems to output the same information as one episode which isn't so helpful for validating a policy's performance. So I had some ideas, tell me what you think.
Single Episode "Run"
The current output (both console and output csv) is perfect. Good for debugging, no need to change anything here.
Multi-Episode "Run" (i.e. Monte Carlo)
Output only the final metric (i.e. reward) value at the end of each episode. In that way, the user can use Excel, pandas, or whatever tool they prefer to analyze the results and compare to their heuristic.