Thank you very much for your work, could you please share your pre-training data set, I would like to use some longer max_length model to replace codebert to handle longer input data, in the meantime, can you briefly explain the steps of the pre-training?
Thank you very much for your work, could you please share your pre-training data set, I would like to use some longer max_length model to replace codebert to handle longer input data, in the meantime, can you briefly explain the steps of the pre-training?