Open isunli opened 4 years ago
I don't know if I am get this correct. According to the BERT paper, author mentioned to use the first vector to do a classification ("[CLS]"). I saw you are using "pooled" vector in your code. Is there any reason?
Thanks, Li Sun
I don't know if I am get this correct. According to the BERT paper, author mentioned to use the first vector to do a classification ("[CLS]"). I saw you are using "pooled" vector in your code. Is there any reason?
Thanks, Li Sun