Closed xdevfaheem closed 1 year ago
Looks like a typical LLM behavior to me. If something was broken, you would see complete gibberish of random tokens, or a single token would repeat infinetely.
I suggest implementing repetition penalty (presence penalty) to decrease repetitions, there is an example in chat_wwith_bot.py
Other suggestions:
Write a Poem About AI
-> Write a poem about AI
Looks like a typical LLM behavior to me. If something was broken, you would see complete gibberish of random tokens, or a single token would repeat infinetely.
I suggest implementing repetition penalty (presence penalty) to decrease repetitions, there is an example in chat_wwith_bot.py
Other suggestions:
- for Raven models, use prompt format recommended by BlinkDL -- here are example prompts that should work
- use 14B instead of 7B
- use less confusing (for the model) input formatting:
Write a Poem About AI
->Write a poem about AI
Oh... it doesn't seem gibberish to u mate.
I'm Bit Excited to Run RWKV 7B on rwkv.cpp When I Found About it after but after hours of setting up. I Got This Output. I Expected RWKV to give good results.
So How Should I Get Good results with Faster Infrence. I'll Sure Check Your Above Advices.
btw, I'm using the prompt from Official RWKV Huggingface Space
def generate_prompt(instruction, input=None):
--
| instruction = instruction.strip().replace('\r\n','\n').replace('\n\n','\n')
| input = input.strip().replace('\r\n','\n').replace('\n\n','\n')
| if input:
| return f"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.
|
| # Instruction:
| {instruction}
|
| # Input:
| {input}
|
| # Response:
| """
| else:
| return f"""Below is an instruction that describes a task. Write a response that appropriately completes the request.
|
| # Instruction:
| {instruction}
|
| # Response:
| """
I'm Using https://huggingface.co/xzuyn/RWKV-4-Raven-7B-v11x-Eng99-Other1-20230429-ctx8192-ggml-q5_1 ggml weights with rwkv.cpp modified infrence scripts, which is
I Guess it to generate good outputs, but i got the following outputs
I Tried Multiple Times with Multiple Prompts, But I Got No Luck
So is there anything i can do to get good outputs or what's the problem here