Closed tjohnson31415 closed 1 month ago
The usage of add_special_tokens in Generate is not replicated in Tokenize. This means that the tokenization result/count will be different if ADD_SPECIAL_TOKENS=False.
add_special_tokens
ADD_SPECIAL_TOKENS=False
Let's make sure to add a unit test and integration test for it if applicable!
The usage of
add_special_tokens
in Generate is not replicated in Tokenize. This means that the tokenization result/count will be different ifADD_SPECIAL_TOKENS=False
.Let's make sure to add a unit test and integration test for it if applicable!