microsoft / CodeXGLUE

CodeXGLUE
MIT License
1.56k stars 366 forks source link

Code-Code / Clone-detection-BigCloneBench : How does a 800-length input fit into the model? #69

Closed skkuai closed 3 years ago

skkuai commented 3 years ago

Hello. Thank you for your work.

I'm testing your clone detection code. The maximum input length for the encoder in the model is 512. By the way, the block_size is 400, so the sequence length of input is 800, why does the code run without an error?

I looked at the source code document for the Roberta model, but am I missing something?

guoday commented 3 years ago

Two codes are fed into the model separately. So the length of input is 400 instead of 800.

skkuai commented 3 years ago

I got it. Thanks!