Open yucc-leon opened 4 months ago
I found in this repo and the huggingface model card there is a line:
# tokenizer.eos_token_id is the id of <|EOT|> token
But in tokenizer_config.py inside model repo the eos_token is set to be <|end▁of▁sentence|>:
eos_token
<|end▁of▁sentence|>
"eos_token": { "__type": "AddedToken", "content": "<|end▁of▁sentence|>", "lstrip": false, "normalized": true, "rstrip": false, "single_word": false }
which is correct?
I found in this repo and the huggingface model card there is a line:
But in tokenizer_config.py inside model repo the
eos_token
is set to be<|end▁of▁sentence|>
:which is correct?