huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

https://huggingface.co/transformers

Apache License 2.0

132.29k stars 26.35k forks source link

Using whole word masking on training LM from scratch #4577

Closed uunal closed 3 years ago

uunal commented 4 years ago

❓ Questions & Help

Details

Hello everyone, I wanted to use whole-word-masking in training LM from scratch. I could not have found how to apply this option using Trainer. I thought this option should be managed in "class DataCollatorForLanguageModeling", but I could not find options for whole-word-masking. Am I looking at wrong place OR it is not implemented yet? If not, is it possible to do with run_language_modeling.py?

A link to original question on Stack Overflow: https://stackoverflow.com/questions/62061578/how-to-use-whole-word-masking-on-training-lm-from-scratch

Any help is appreciated! Thanks