Closed BinhMinhs10 closed 1 year ago
I am planning to pretrain DeBERTa v3 with RTD and Gradient disentagled embedding sharing. But i don't have and proper references and resources on how to start pretraining it.
I find the document. But sadly, pre-training-with-replaced-token-detection-task freezed at Coming soon... state.
pre-training-with-replaced-token-detection-task
Coming soon...
updated.
I am planning to pretrain DeBERTa v3 with RTD and Gradient disentagled embedding sharing. But i don't have and proper references and resources on how to start pretraining it.