Open abuelnasr0 opened 3 months ago
I opened this PR instead of keras-team/keras-nlp#1447. This PR:
special_tokens_in_strings
<s>
</s>
I also renamed unsplittable_tokens to special_tokens to be similar to other tokenizers. not sure if it's necessary.
unsplittable_tokens
special_tokens
I opened this PR instead of keras-team/keras-nlp#1447. This PR:
special_tokens_in_strings
Arg to byte_pair_tokenizer.<s>
and</s>
to the same id.I also renamed
unsplittable_tokens
tospecial_tokens
to be similar to other tokenizers. not sure if it's necessary.