akataba / rl-repo

Contains different reinforcement learning models
3 stars 3 forks source link

More plot fixes #46

Closed Farquhar13 closed 6 months ago

Farquhar13 commented 7 months ago

@choihrM3 noticed an indexing issue when selecting the last-step fidelities/rewards from an episode. This indexing issue is caused because in the scripts alg_config.build() calls the step function of the environment one time, and thus adds to the environment's transition_history list and makes the previous logic for indexing off by one.

This PR uses episode_id to select the fidelity and reward from the last step of each episode for plotting. This should be more robust.

This PR also removes the try/except code for loading .npy files, which to the best of my knowledge are unused.