I'm having trouble evaluating my model using the provided checkpoint, and I noticed that the evaluation metrics are different from those reported in the original paper. Is there anything wrong with my setup or is this an inconsistency between the implementation and the paper?
I'm having trouble evaluating my model using the provided checkpoint, and I noticed that the evaluation metrics are different from those reported in the original paper. Is there anything wrong with my setup or is this an inconsistency between the implementation and the paper?