microsoft / CodeXGLUE

CodeXGLUE
MIT License
1.57k stars 366 forks source link

run on sample with sequence longer than 500 #77

Closed unclebob7 closed 3 years ago

unclebob7 commented 3 years ago

Hi I have tried to run defect detection on the codebert model according to the repo. However, I was trying to apply it on my own dataset with sequence length larger than 500, the model fail to run. I wonder is there any solution for that?

guoday commented 3 years ago

No. The model only support max length of 512. You can split them, feed the CodeBERT and merge.