Why get_special_tokens_mask appending a [1] at the end while build_inputs_with_special_tokens does not append a [SEP] at the end for a single input sequence ?

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

BSD 3-Clause "New" or "Revised" License

1.57k stars 199 forks source link

Open zhihuacc opened 2 years ago

zhihuacc commented 2 years ago

LiJunnan1992 commented 2 years ago

We remove [SEP] for a single sentence input because it has negligible effect on pre-training.

zhihuacc commented 2 years ago

But why get_special_tokens_mask still appends a [1]. I thought this [1] is for [SEP], right ?

LiJunnan1992 commented 2 years ago

Yes you are right, I have modified the code so that the [1] is not appended. Thank you!