Closed Olwar closed 1 year ago
The rewards don't seem to be a good metric to evaluate the model. I could have more reward and still have lower profit.
That's true. The reward is more useful for the RL agent., but profit is a valid and precise metric for final evaluation.
The rewards don't seem to be a good metric to evaluate the model. I could have more reward and still have lower profit.