dhlee347 / pytorchic-bert

Pytorch Implementation of Google BERT
Apache License 2.0
591 stars 179 forks source link

Why is there any need of max_pred in pretraining? #30

Open wahab4114 opened 3 years ago