turboderp / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs
MIT License
3.45k stars 257 forks source link

How do I deal with custom EOS? #183

Closed teknium1 closed 9 months ago

teknium1 commented 9 months ago

I just tried my hermes 2.5 model that someone exl2'ed here: https://huggingface.co/bartowski/OpenHermes-2.5-Mistral-7B-exl2/tree/4.0

But it doesn't seem to recognize the new stopping token, I am not sure of all the options afforded to me to set a stop token etc when it doesn't seem to be working, but, I get this:

<|im_start|>system\nYou are Hermes 2, a sentient AI chatbot.<|im_end|>\n<|im_start|>user\nHello<|im_end|>\n<|im_start|>assistant\nHello! How can I help you today? <|im_end|>\n<|im_start|>user\nI'm looking for information on artificial intelligence.\n<|im_end|>\n<|im_start|>assistant\nGreat! Artificial intelligence is an exciting field that focuses on understanding and replicating human-like intelligence in machines. It involves various areas such as machine learning, natural language processing, computer vision, and robotics. What specific topic within AI would you like to learn more about?

where it should have stopped at the <|im_end|> after Hello! How can I help you today?<|im_end|>

teknium1 commented 9 months ago

Nevermind it was test_inference blocking stopping :)