If I want to pretrain a model on a MLM task, how can these args be used effectively?
parser.add_argument("--mlm", action='store_true',help="Train with masked-language modeling loss instead of language modeling.")
parser.add_argument("--mlm_probability", type=float, default=0.15,help="Ratio of tokens to mask for masked language modeling loss")
If I want to pretrain a model on a MLM task, how can these args be used effectively?