Closed zhengwsh closed 4 years ago
Due to the difference in the implementations of fairseq and HuggingFace's, we have not spent time tuning the hyperparameters for the HuggingFace's version... We will handle the dropout mask for RoBERTa and release it soon!
Can you provide the FreeLB-RoBERTa in HuggingFace's transformers