nlpxucan / ZRKGC

38 stars 10 forks source link

special tokens of "<#Q2K#>" #1

Open jind11 opened 3 years ago

jind11 commented 3 years ago

Hi, I tried to run this code but encountered one problem: there are several special tokens such as "<#Q2K#>", "<#K#>", and "<#Q#>" after bert tokenization, however, the original bert vocab does not contain these three tokens, which caused the tokenization indexing error. How did you solve this issue? Thanks!