What's the impact of running MLM during training?

alexa / dialoglue

DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue

https://evalai.cloudcv.org/web/challenges/challenge-page/708/overview

Apache License 2.0

282 stars 27 forks source link

What's the impact of running MLM during training? #21

Open shiweii2r opened 1 year ago

shiweii2r commented 1 year ago

Hi, thank you very much for sharing the code! From the code of run.py, in function train(), during each training epoch, if args.mlm_during is true, "Run MLM during training" part will run. But it doesn't change "model". Is it because this sentence "model.bert_model = pre_model.bert_model.bert", that "model" and "pre_model" share the same weights since they use the same physical address? Thank you very much!