deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
MIT License
3.47k stars 143 forks source link

Question about the design of bos and eos token #85

Open jojo23333 opened 1 month ago

jojo23333 commented 1 month ago

Hi, Thanks for the great work. I'm just in general curious about whether there is a reason to use the Chinese version of '|' and '▁'instead of the '|' , ‘_’ which is standard ASCII characters in eos_token and bos_token. ('<|end▁of▁sentence|>' and '<|begin▁of▁sentence|>' ). Is this for distinguishing deep seek model from English only LLM's like Llamma?

image