Open HermitSun opened 1 year ago
It's because the new vLLM doesn't decode special tokens. We've fixed it by replacing the stop token, can you try again?
Thank you for your reply. Could you tell me what's the stop token now?
Or furthermore, is there any example of calling OpenCoderPlus correctly?
Thanks in advance.
I tried to launch OpenCoderPlus with the latest code of this repo and vLLM:
It can work, but the outputs will never stop util hitting the
max_tokens
limit, even if I pass thestop
parameter:I refered to OpenCoderPlus's training data, it seems that this model is training on data with the
<|end_of_turn|>
character.So does anyone know how to stop this model's outputs? Any help will be appreciated.