Closed G07cha closed 4 months ago
The eos_token_id
of deepseek-coder-7b-instruct v1.5
is 100015, because the vocabulary is different from v1.
@guoday do you know what value eos_token_id
of v1.5 model should be set to when support for code completion is needed?
Are you referring to deepseek-coder-base-v1.5? The eos_token_id
is 100001. However, when you use the coder for code completion, you shouldn't be able to stop generation using the eos_token_id
. Typically, in code completion, you need to specify when to stop generation yourself, such as generating one line, generating a block, or generating 64 tokens.
Ah clear, couldn't make the connection between eos_token_id
from base model and instruct one, thanks!
A rookie question, I was following README's guide on setting up deepseek-coder-instruct to complete the code and wanted to try it with deepseek-coder-7b-instruct-v1.5, but there
eos_token_id
is set to100015
unlike32021
for regulardeepseek-coder-7b-instruct
model. Caneos_token_id: 32014
still used for v1.5 model?