Closed Yahiy closed 1 month ago
Thanks for your question and interest in the evaluation part.
[last_image_path, second_last_image_path]
-> [last_image_path, this_image_path]
. Please refer to this part. I also updated the comments here just now.Thanks, I get it, it's a step level evaluator, so same imgs means not Success before this step and this img too. I thought it was a trajectory level evaluator.
Closing as the problem is solved.
https://github.com/DigiRL-agent/digirl/blob/d918012ab47c98b2d448b168848c6f6f1936a1e5/digirl/environment/android/evaluate.py#L211
A few questions about the evaluation strategy in the code: