Open Neo-Zhangjiajie opened 2 years ago
Could you please check the version of the transformers package? With transformers=4.2.1
(mentioned in requirements.txt), both tokenizers yield the same results:
I <extra_id_0> you.
[27, 32099, 25, 5, 1]
I<extra_id_0> you.</s>
['▁']
When I use VLT5TokenizerFast to encode the sentence, there will be a token id 3 ( '▁') before id of token. For example,
`
If I just use T5tokenizerFast, it is ok, and the output is
Is there any solution? Thanks!