google-research / albert

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
Apache License 2.0
3.24k stars 569 forks source link

_best_trial_info #66

Closed dmoham1476 closed 4 years ago

dmoham1476 commented 4 years ago

Why is evaluation run on every checkpoint that the model is trained on? I see the function _best_trial_info is new to ALBERT and was not there in BERT run_classifier. I fine-tuned my ALBERT model on 75000 steps and it seems to be evaluating on all these checkpoints?