VinAIResearch / PhoBERT

PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
MIT License
651 stars 92 forks source link

Code to train model to reproduce the model and evaluate in all experiments #42

Closed dinhanhx closed 2 years ago

dinhanhx commented 2 years ago

In other words, I would like to know the source code corresponding to Section 2 and 3 in the paper.

datquocnguyen commented 2 years ago

See: https://github.com/facebookresearch/fairseq/blob/main/examples/roberta/README.pretraining.md & https://github.com/huggingface/transformers/tree/main/examples